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NOVEL STREPTOCOCCUS ANTIGENS 
FIELD OF THE INVENTION 

5 The present invention is related to antigens, more 
particularly protein antigens of streptococcus 
pneumoniaepathogen which are useful as vaccine components 
for therapy and/or prophylaxis. 

10 BACKGROUND OF THE INVENTION 

pneumoniae is an important agent of disease in man 
especially among infants, the elderly and immunocompromised 
persons. It is a bacterium frequently isolated from 
patients with invasive diseases such as 

bacteraemia/septicaemia, pneumonia, meningitis with high 
morbidity and mortality throughout the world. Even with 
appropriate antibiotic therapy, pneumococcal infections 
still result in many deaths. Although the advent of 
antimicrobial drugs has reduced the overall mortality from 
pneumococcal disease, the presence of resistant pneumococcal 
organisms has become a major problem in the world today. 
Effective pneumococcal vaccines could have a major impact on 
the morbidity and mortality associated with pneumoniae 
disease. Such vaccines would also potentially be useful to 
prevent otitis media in infants and young children. 

Efforts to develop a pneumococcal vaccine have generally 
concentrated on generating immune responses to the 
pneumococcal capsular polysaccharide. More than 80 
pneumococcal capsular serotypes have been identified on the 
basis of antigenic differences. The currently available 
pneumococcal vaccine, comprising 23 capsular polysaccharides 
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that most frequently caused disease, has significant 
shortcomings related primarily to the poor immunogenicity of 
some capsular polysaccharides, the diversity of the 
serotypes and the differences in the distribution of 
serotypes over time, geographic areas and age groups. in 
particular, the failure of existing vaccines and capsular 
conjugate vaccines currently in development to protect young 
children against all serotypes spurres evaluation of other 

pneumoniae components. Although immunogenicity of 
capsular polysaccharides can be improved, serotype 
specificity will still represent a major limitation of 
polysaccharide-based vaccines. The use of a antigenically 
conserved immunogenic pneumococcal protein antigen, either 
by itself or in combination with additional components, 
offers the possibility of a protein-based pneumococcal 
vaccine. 



PCT Publication number WO98/18930 published may 7 1998 
entitled « Streptococcus Pneumoniae antigens and vaccines" 
20 describes certain polypeptides which are claimed to be 
antigenic. However, no biological activity of these 
polypeptides is reported. 

Therefore their remains an unmet need for Streptococcus 
25 antigens that may be used as vaccine components for the 
prophylaxis and/or therapy of Streptococcus infection. 
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SUMMARY OF THE INVENTION 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOst 2, 4, 6, 8, 10, 14, 16, 55 
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to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof. 

In other aspects, there are provided vectors comprising 
5 polynucleotides of the invention operably linked to an 
expression control region, as well as host cells 
trans fee ted with said vectors and methods of producing 
polypeptides comprising culturing said host cells under 
conditions suitable for expression. 

10 

In yet another aspect, there are provided novel 
polypeptides encoded by polynucleotides of the invention. 



15 BRIEF DE SCRIPTION OP THE DRAWINGS 

Figure 1 is the DNA sequence of BVH-3 gene; SEQ ID NO: 1. 

Figure 2 is the amino acid sequence of BVH-3 protein; SEQ ID 
20 NO: 2. 

Figure 3 is the DNA sequence of BVH-11 gene; SEQ ID NO: 3. 

Figure 4 is the amino acid sequence of BVH-11 protein; SEQ 
25 ID NO: 4. 

Figure 5 is the DNA sequence of BVH-28 gene; SEQ ID NO: 5. 

Figure 6 is the amino acid sequence of BVH-28 protein; SEQ 
30 ID NO: 6. 

1 

Figure 7 is the DNA sequence of BVH-3A gene which 
corresponds to the 5' terminal end of BVH-3; SEQ ID NO: 7. 
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Figure 8 is the amino acid sequence of BVH-3A protein; SEQ 
ID NOt 8. 

Figure 9 is the DNA sequence of BVH-3B gene which 
corresponds to the 3' terminal end of BVH-3; SEQ ID NO: 9. 

Figure 10 is the amino acid sequence of BVH-3B protein; SEQ 
ID NOi 10. 



Figure 11 depicts the comparison of the predicted amino 
acid sequences of the BVH-3 open reading frames from WU2, 
RX1, JNR.7/87, SP64, P4241 and A66 pneumoniae strains' 
by using the program Clustal W from MacVector sequence 
analysis software (version 6.5). Underneath the alignment, 
there is a consensus' line where * and . characters indicate 
identical and similar amino acid residues, respectively. 

Figure 12 depicts the comparison of the predicted amino 
acid sequences of the BVH-11 open reading frames from WU2, 
Rxl, JNR.7/87, SP64, P4241, A66 and SP63 L pneumonia 
strains by using the program Clustal W from MacVector 
sequence analysis software (version 6.5). Underneath the 
alignment, there is a consensus line where * and 
characters indicate identical and similar amino acid 
residues, respectively. 



Figure 13 depicts the comparison of the predicted amino 
acid sequences of the BVH-11 proteins from various S,. 
30 pneumoniae strains. The degrees of identity (I) and 

similarity (S) were determined by using the program Clustal 
W from MacVector sequence analysis software (version 6.5). 



Figure 14 is a DNA sequence containing the complete BVH-3 
gene (open reading frame "ORF" at nucleotides 1777 to 
4896) ; SEQ ID NO: 11. 



4 



WO 00/39299 



PCT/CA99/01218 



Figure 15 is a DNA sequence containing the complete BVH-11 
gene (ORP at nucleotides 45 to 2567); SEQ id NOi 12. 

5 Figure 16 is a DNA sequence containing the complete BVH-11- 
2 gene (ORF at nucleotides 114 to 2630); SEQ ID NO: 13. 

Figure 17 is the amino acid sequence of BVH-11-2 protein; 
SEQ ID NO: 14. 

10 

Figure 18 is the DNA sequence of SP63 BVH-3 gene; SEQ ID 
NO: 15. 



Figure 19 is the amino acid 
15 SEQ ID NO: 16. 

Figure 20 is the amino acid 
ID NO: 55. 

20 Figure 21 is the amino acid 
SEQ ID NO: 56. 

Figure 22 is the amino acid 
SEQ ID NO: 57. 

25 

Figure 23 is the amino acid 
ID NO: 58. 

Figure 24 is the amino acid 
30 ID NO i 59. 

Figure 25 is the amino acid 
SEQ ID NO: 60. 

35 Figure 26 is the amino acid 
SEQ ID NO: 61. 



sequence of SP63 BVH-3 protein; 



sequence of BVH-3M protein; SEQ 



sequence of BVH-3 AD protein; 



sequence of L-BVH-3-AD protein; 



sequence of NEW12 protein; SEQ 



sequence of BVH-3C protein; SEQ 



sequence of BVH-11M protein; 



sequence of BVH-11A protein; 
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Figure 27 is the amino acid sequence of BVH-11B (also 
called Newl3) protein; SEQ id NO: 62. 

5 Figure 28 is the amino acid sequence of BVH-11C protein; 
SEQ ID NO: 63. 



Figure 29 is the amino acid sequence of NEW1 protein; SEQ 
ID NO: 64. 
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Figure 30 is the amino acid sequence of NEW2 protein; SEQ 
XD NO: 65. 

Figure 31 is the amino acid sequence of NEW3 protein; SEQ 
15 ID NO: 66. 

Figure 32 is the amino acid sequence of NEW4 protein- SEQ 
ID NO: 67. 

20 Figure 33 is the amino acid sequence of NEWS protein; SEQ 
ID NO: 68. 

Figure 34 is the amino acid sequence of NEW6 protein; SEQ 
ID NO: 69. 

25 

Figure 35 is the amino acid sequence of NEW7 protein; SEQ 
ID NO: 70. 

Figure 36 is the amino acid sequence of NEWS protein; SEQ 
30 ID NO: 71. 

Figure 37 is the amino acid sequence of NEW9 protein; SEQ 
ID NO: 72. 

35 Figure 38 is the amino acid sequence of BVH-11-2M protein- 
SEQ ID NO: 73. 
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Figure 39 is the amino acid sequence of NEW10 protein; SEQ 
ID NO: 74. 

5 Figure 40 is the amino acid sequence of NEW11 protein; SEQ 
ID NO: 75. 



Figure 41 is the DNA sequence of NEW12 gene; SEQ ID NO: 76. 

10 Figure 42 is the amino acid sequence of NEW14 protein; SEQ 
ID NO: 77. 

Figure 43 is the amino acid sequence of NEW15 protein; SEQ 
ID NO: 78. 
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Figure 44 is the amino acid sequence of NEW16 protein; SEQ 
ID NO: 79. 



Figure 45 is the DNA sequence of GBS BVH-71 gene; SEQ ID 
20 NO: 80. 



Figure 46 is the amino acid sequence of GBS BVH-71 
protein; SEQ ID NO: 81. 

25 Figure 47 is the DNA sequence of GAS BVH-71 gene; SEQ ID 
NO:82. 

Figure 48 is the amino acid sequence of GAS BVH-71 protein; 
SEQ ID NO: 83. 

30 

DETAILED DBfifPTPTIQN OP THE IMVEWTTflM 
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According to one aspect, the present invention provides 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 
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to 75, 77 to 79, 81, 83 or fragments, analogs or. 
derivatives thereof. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 95% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 
to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 2, 4, 8, 10, 14, 16, 55 to 
75, 77 to 79, 81, 83 or fragments, analogs or derivatives 
thereof . 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 2, 4, 10, 14, 16, 55 to 
75, 77 to 79, 81, 83 or fragments, analogs or derivatives 
thereof . 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 2, 4, 8, 10, 14, 16, 55 to 
75, 77 to 79 or fragments, analogs or derivatives thereof. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 2, 8, 10, 16, 55, 56, 57, 
58, 59, 64, 65, 66, 78 or fragments, analogs or derivatives 
thereof . 
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According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
5 sequence chosen from SEQ ID NOs: 2, 8, 10, 16, 55, 56, 57, 
59, 64, 65, 66, 78 or fragments, analogs or derivatives 
thereof . 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs; 4, 14, 58, 60, 61, 62, 63, 

67, 68, 69, 70, 71, 72, 73, 74, 75, 77, 79 or fragments, 
analogs or derivatives thereof. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 7 0% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 4, 14, 60, 61, 62, 63, 67, 

68, 69, 70, 71, 72, 73, 74, 75, 77, 79 or fragments, 
analogs or derivatives thereof. 

According to one aspect, the present invention provides an 
25 isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 2, 4, 10, 14, 16, 55 to 
75, 77 to 79 or fragments, analogs or derivatives thereof. 

30 According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least; 70% identity to a second polypeptide comprising 
sequence chosen from SEQ ID NOs : 10, 55 to 75, 77, 78, 79 
or fragments, analogs or derivatives thereof. 

35 

According to one aspect, the present invention provides an 
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isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence chosen from SEQ ID NOs; 55 to 75, 77, 78, 79 or 
fragments, analogs or derivatives thereof. 

5 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 2, 4, 6, 8, 10 or 
10 fragments, analogs or derivatives thereof. 



According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
15 least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 2, 4, 10, 14, 16 or 
fragments, analogs or derivatives thereof. 

According to one aspect, the present invention provides an 
20 isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 2, 4, 14, 16 or fragments, 
analogs or derivatives thereof. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NO: 2 or fragments, analogs or derivatives 
thereof . 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NOs 4 or fragments, analogs or derivatives 
thereof. 
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According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NOi 10 or fragments, analogs or derivatives 
thereof. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NO* 14 or fragments, analogs or derivatives 
thereof . 



According to. one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NOt 16 or fragments, analogs or derivatives 
thereof . 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NOt 58 or fragments, analogs or derivatives 
thereof . 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NOt 60 or fragments, analogs or derivatives 
thereof . 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NO* 62 or fragments, analogs or derivatives 
thereof . 
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According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NO* 64 or fragments, analogs or derivatives 
5 thereof . 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
10 sequence SEQ ID NO: 67 or fragments, analogs or derivatives 
thereof . 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
15 least 70% identity to a second polypeptide comprising 

sequence SEQ ID NO: 68 or fragments, analogs or derivatives 
thereof . 

According to one aspect, the present invention provides an 
20 isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NO: 69 or fragments, analogs or derivatives 
thereof . 

25 According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NOt 72 or fragments, analogs or derivatives 
thereof . 

3Q 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NOi 74 or fragments, analogs or derivatives 
35 thereof. 
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According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NO* 77 or fragments, analogs or derivatives 
thereof . 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
chosen from SEQ ID NOs: 2, 4, 6, 8, 10 or fragments, 
analogs or derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 to 75, 
77 to 79, 81, 83 or fragments, analogs or derivatives 
thereof . 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
chosen from SEQ ID NOs: 2, 4, 8, 10, 14, 16, 55 to 75, 77 
to 79, 81, 83 or fragments, analogs or derivatives thereof. 

According to . one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
chosen from SEQ ID NOs: 2, 4 , 10, 14, 16, 55 to 75, 77 to 
79, 81, 83 or fragments, analogs or derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
chosen from SEQ ID NOs: 2, 4, 8, 10, 14, 16, 55 to 75, 77 
to 79 or fragments, analogs or derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
chosen from SEQ ID NOs: 2, 4, 10, 14, 16, 55 to 75, 77 to 
79 or fragments, analogs or derivatives thereof. 
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According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
chosen from SEQ ID NOs: 2, 4, 10, 14, 16 or fragments, 
5 analogs or derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 2 or fragments, analogs or 
10 derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 4 or fragments, analogs or 
15 derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 10 or fragments, analogs or 
20 derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 14 or fragments, analogs or 
25 derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 16 or fragments, analogs or 
*0 derivatives thereof. 

According to one aspect, the present invention relates 'to 
polypeptides characterized by the amino acid sequence 
chosen from SEQ ID NOs: 10, 55 to 75, 77, 78, 79 or 
fragments, analogs or derivatives thereof. 
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According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
chosen from SEQ ID NO: 10/ 58, 60, 62, 64, 67, 68, 69, 72, 
74, 77 or fragments, analogs or derivatives thereof. 

5 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
chosen from SEQ ID NO: 10, 58, 60, 62, 64, 67, 68, 69, 72, 
74, 77 or fragments, analogs or derivatives thereof. 

10 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
chosen from SEQ ID NO: 10, 58, 60, 62, 64, 67, 68, 69, 72, 
74, 77 or fragments, analogs or derivatives thereof. 

15 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
chosen from SEQ ID NO: 10, 62, 64, 67, 68, 74, 77 or 
fragments, analogs or derivatives thereof. 

20 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 58 or fragments, analogs or 
derivatives thereof. 

25 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 62 or fragments, analogs or 
derivatives thereof. 

30 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 64 or fragments, analogs or 
derivatives thereof. 

35 

According to one aspect, the present invention relates to 

15 
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polypeptides characterized by the amino acid sequence 
comprising sequence SEQ id NO: 67 or fragments, analogs or 
derivatives thereof. 

5 According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ id NO: 68 or fragments, analogs or 
derivatives thereof. 
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According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 74 or fragments, analogs or 
derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 77 or fragments, analogs or 
derivatives thereof. 

in a further embodiment, the present invention also relates 
to chimeric polypeptides which comprise one or more 
polypeptides or fragments, analogs or derivatives thereof 
as described in the present application. 

In a further embodiment, the present invention also relates 
to chimeric polypeptides which comprise one or more 
polypeptides or fragments, analogs or derivatives thereof 
as defined in the figures of the present application. 

In a further embodiment, the present application also 
relates to chimeric polypeptides which comprise two or more 
polypeptides chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 
16, 55 to 75, 77 to 79, 81, 83 or fragments, analogs or' 
derivatives thereof ,-provided that the polypeptides or 
fragments, analogs or derivatives thereof are linked as to 
form a chimeric polypeptide. 
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In a further embodiment, the chimeric polypeptide will 
comprise two or more polypeptides chosen from SEQ ID 
NOs :10, 58, 60, 62, 64, 67, 68, 69, 72, 74, 77 or 

fragments, analogs or derivatives thereof; provided that 
the polypeptides or fragments, analogs or derivatives 
thereof are linked as to form a chimeric polypeptide. 

In a further embodiment, the chimeric polypeptide will 
comprise two or more polypeptides chosen from SEQ ID 
NOs :10, 58, 60, 62, 64, 67, 68, 74, 77 or fragments, 
analogs or derivatives thereof; provided that . the 
polypeptides or fragments, analogs or derivatives thereof 
are linked as to form a chimeric polypeptide. 

In a further embodiment, the chimeric polypeptide will 
comprise two or more polypeptides chosen from SEQ ID 
NOs .10, 62, 64, 67, 68, 74, 77 or fragments, analogs or 
derivatives thereof; provided that the polypeptides or 
fragments, analogs or derivatives thereof are linked as to 
form a chimeric polypeptide. 

In a further embodiment, the chimeric polypeptide will 
comprise between 2 and 5 polypeptides. 

In a further embodiment , the chimeric polypeptide will 
comprise between 2 and 4 polypeptides. 

In a further embodiment, the chimeric polypeptide will 
comprise between 2 and 3 polypeptides. 

In a further embodiment, the chimeric polypeptide will 
comprise 2 polypeptides. 
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In a further embodiment, there is provided a chimeric 
polypeptide of formula (I) : 
a-(B).-(C).-d (I) 

5 Wherein; 

m is 0 or 1, 
n is 0 or 1, 

A is chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 to 

75, 77 to 79, 81, 83 or fragments, analogs or derivatives 
10 thereof ; 

B is chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 to 
75, 77 to 79, 81, 83 or fragments, analogs or derivatives 
thereof ; 

C is chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 to 
15 75, 77 to 79, 81, 83 or fragments, analogs or derivatives 
thereof ; and 

D is chosen from SEQ ID NOs : 2, 4, 6, 8, 10, 14, 16, 55 to 
75, 77 to 79, 81, 83 or fragments, analogs or derivatives 
thereof . 

20 

In a further embodiment, 

A is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 

69, 72, 74, 77 or fragments, analogs or derivatives 
thereof; 

25 B is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 
69, 72, 74, 77, or fragments, analogs or derivatives 
thereof; 

C is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 

69, 72, 74, 77 or fragments, analogs or derivatives 
thereof; and 

D is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 

69, 72, 74, 77 or fragments, analogs or derivatives 
thereof. 

In a further embodiment. 
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A is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 
74, 77 or fragments, analogs or derivatives thereof; 
B is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 
74, 77, or fragments, analogs or derivatives thereof; 
C is chosen from SEQ ID NOs ilO, 58, 60, 62, 64, 67, 68, 
74, 77 or fragments, analogs or derivatives thereof; and 
D is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 
74, 77 or fragments, analogs or derivatives thereof. 

In one embodiment, chimeric polypeptides of the present 
invention comprise those wherein the following embodiments 
are present, either independently or in combination. 

In a further embodiment, A is SEQ ID NOs :10, 58, 62, 64, 
67, 68, 74, 77 or fragments, analogs or derivatives 
thereof . 

in a further embodiment, A is SEQ ID NO slO or fragments, 
analogs or derivatives thereof. 

In a further embodiment, A is SEQ ID NO :58 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, A is SEQ ID NO :62 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, A is SEQ ID NO :64 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, A is SEQ ID NO :67 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, A is SEQ ID NO :68 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, A is SEQ ID NO .;74 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, A is SEQ ID NO i77 or fragments, 
analogs or derivatives thereof. 
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In a further embodiment, B is SEQ ID NOs :10, 58, 62, 64, 
67, 68, 74, 77 or fragments, analogs or derivatives 
thereof . 

In a further embodiment, B is SEQ ID NO :10 or fragments, 
5 analogs or derivatives thereof. 

In a further embodiment, B is SEQ ID NO :58 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, B is SEQ ID NO :64 or fragments, 
analogs or derivatives thereof. 
10 In a further embodiment, B is SEQ ID NO :64 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, B is SEQ ID NO :67 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, B is SEQ ID NO :68 or fragments, 
15 analogs or derivatives thereof . 

In a further embodiment, B is SEQ ID NO :74 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, B is SEQ ID NO : 77 or fragments, 
analogs or derivatives thereof. 
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In a further embodiment, C is SEQ ID NOs :10, 58, 62, 64, 
67, 68, 74, 77 or fragments, analogs or derivatives 
thereof . 

In a further embodiment, C is SEQ ID NO :10 or fragments, 
25 analogs or derivatives thereof. 

In a further embodiment, C is SEQ ID NO :58 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, C is SEQ id NO : 62 or fragments, 
analogs or derivatives thereof. 
30 In a further embodiment, C is SEQ ID NO :64 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, C is SEQ id NO i 67 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, C is SEQ ID NO : 68 or fragments, 
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analogs or derivatives thereof . 

In a further embodiment, C is SEQ ID NO : 74 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, C is SEQ ID NO : 77 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, D is SEQ ID NO :10, 58, 62, 64, 
67, 68, 74, 77 or fragments, analogs or derivatives 
thereof . 

In a further embodiment, D is SEQ ID NO ilO or fragments, 
analogs or derivatives thereof. 

In a further embodiment, D is SEQ ID NO :58 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, D is SEQ ID NO :62 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, D is SEQ ID NO :64 or fragments,, 
analogs or derivatives thereof. 

In a further embodiment, D is SEQ ID NO :67 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, D is SEQ ID NO :68 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, D is SEQ ID NO :74 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, D is SEQ ID NO :77 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, m is 0. 
In a further embodiment, n is 0. 

In a 'further embodiment, m and n are 0. 

In a further embodiment, m and n are 0, A is SEQ ID NO: 64 
or fragments, analogs or derivatives thereof, B is SEQ ID 
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NO: 62 or fragments, analogs or derivatives thereof. 
In a further embodiment, m and n are 0, A is SEQ ID NO: 62 
or fragments, analogs or derivatives thereof, B is SEQ ID 
NO: 64 or fragments, analogs or derivatives thereof. 

5 

In accordance with the present invention, all nucleotides 
encoding polypeptides and chimeric polypeptides are within 
the scope of the present invention. 

10 In a further embodiment, the polypeptides or chimeric 

polypeptides in accordance with the present invention are 
antigenic. 

In a further embodiment, the polypeptides or chimeric 
15 polypeptides in accordance with the present invention can 
elicit an immune response in an individual. 

In a further embodiment, the present invention also relates 
to polypeptides which are able to raise antibodies having 
20 binding specificity to the polypeptides or chimeric 

polypeptides of the present invention as defined above. 

An antibody that ■ has binding specificity" is an antibody 
that recognizes and binds the selected polypeptide but 
25 which does not substantially recognize and bind other 
molecules in a sample, e.g., a biological sample, which 
naturally includes the selected peptide. Specific binding 
can be measured using an ELISA assay in which the selected 
polypeptide is used as an antigen. 

30 

Unless otherwise defined, all technical and scientific 
terms used herein have the same meaning as commonly 
understood by one of ordinary skill in the art to which 
this invention belongs. All publications, patent 
35 applications, patents, and other references mentioned 

herein are incorporated by reference in their entirety. In 



22 



10 



15 



20 



25 



30 



35 



WO 00/39299 

PCT/CA99/01218 

case of conflict, the present specification, including 
definitions, will control. In addition, the materials, 
methods, and examples are illustrative only and not 
intended to be limiting. 

As used herein, "fragments", -derivatives" or "analogs" of 
the polypeptides of the invention include those 
polypeptides in which one or more of the amino acid 
residues are substituted with a conserved or non-conserved 
amino acid residue (preferably conserved) and which may be 
natural or unnatural. In one embodiment, derivatives and 
analogs of polypeptides of the invention will have about 
70% identity with those sequences illustrated in the 
figures or fragments thereof. That is, 70% of the residues 
are the same. In a further embodiment, polypeptides will 
have greater than 75% homology, m a further embodiment, 
polypeptides will have greater than 80% homology. In a 
further embodiment, polypeptides will have greater than 85% 
homology. In a further embodiment, polypeptides will have 
greater than 90% homology, in a further embodiment, 
polypeptides will have greater than 95% homology, m a 
further embodiment, polypeptides will have greater than 99% 
homology, m a further embodiment, derivatives and analogs 
of polypeptides of the invention will have fewer than about 
20 amino acid residue substitutions, modifications or 
deletions and more preferably less than 10. Preferred 
substitutions are those known in the art as conserved i.e. 
the substituted residues share physical or chemical 
properties such as hydrophobic! ty, size, charge or 
functional groups. 

In accordance with the present invention, polypeptides of 
the invention include both polypeptides and chimeric 
polypeptides . 

Also included are polypeptides which have fused thereto 
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other compounds which alter the polypeptides biological or 
pharmacological properties i.e. polyethylene glycol (PEG) 
to increase half-life; leader or secretory amino acid 
sequences for ease of purification; prepro- and pro- 
sequences; and (poly) saccnarides . 

Furthermore, in those situations where amino acid regions 
are found to be polymorphic, it may be desirable to vary 
one or more particular amino acids to more effectively 
mimic the different epitopes of the different streptococcus 
strains . 



Moreover, the polypeptides of the present invention can be 
modified by terminal -NH a acylation (eg. by acetylation, or 
thioglycolic acid amidation, terminal carbosy amidation, 
e.g. with ammonia or methylamine) to provide stability, 
increased hydrophobicity for linking or binding to a 
support or other molecule. 

Also contemplated are hetero and homo polypeptide multimers 
of the polypeptide fragments, analogues and derivatives. 
These polymeric forms include, for example, one or more 
polypeptides that have been cross-linked with cross-linkers 
such as avidin/biotin, gluteraldehyde or dimethyl - 
superimidate. Such polymeric forms also include 
polypeptides containing two or more tandem or inverted 
contiguous sequences, produced from multicistronic mRNAs 
generated by recombinant DNA technology. 
Preferably, a fragment, analog or derivative of a 
polypeptide of the invention will comprise at least one 
antigenic region i.e. at least one epitope. 

i 

In order to achieve the formation of antigenic polymers 
(i.e. synthetic multimers), polypeptides may be utilized 
having. bishaloacetyl groups, nitroarylhalides, or the like, 
where the reagents being specific for thio groups. 
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Therefore, the link between two mercapto groups pf the 
different peptides may be a single bond or may be composed 
of a linking group of at least two, typically at least 
four, and not more than 16, but usually not more than about 
5 14 carbon atoms. 

In a particular embodiment, polypeptide fragments, analogs 
and derivatives of the invention do not contain a 
methionine (Met) starting residue. Preferably, 

.0 polypeptides will not incorporate a leader or secretory 
sequence (signal sequence). The signal portion of a 
polypeptide of the invention may be determined according to 
established molecular biological techniques. In general, 
the polypeptide of interest may be isolated from a 

5 streptococcus culture and subsequently sequenced to 

determine the initial residue of the mature protein and 
therefore the sequence of the mature polypeptide. 
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According to another aspect, there are provided vaccine 
compositions comprising one or more streptococcus 
polypeptides of the invention in admixture with a 
pharmaceutical^ acceptable carrier diluent or adjuvant . 
Suitable adjuvants include oils i.e. Freund's complete or 
incomplete adjuvant; salts i.e. AlK(S0 4 ) 2 , AlNa(SOJ s , 
AlNH 4 (SO t ) 2 , silica, kaolin, carbon polynucleotides i.e. 
poly IC and poly AU. Preferred adjuvants include QuilA and 
Alhydrogel. Vaccines of the invention may be administered 
parenterals by injection, rapid infusion, nasopharyngeal 
absorption, dermoabsorption, or bucal or oral. 

Pharmaceutical^ acceptable carriers also include tetanus 

toxoid. 

Vaccine compositions of the invention are used for the 
treatment or prophylaxis of streptococcus infection and/or 
diseases and symptoms mediated by streptococcus infection 
as described in P.R. Murray (Ed, in chief), E.J. Baron, M A 
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Pfaller, F.c. Tenover and r.h. Yolken. Manual of Clinical 
Microbiology, ASM Press, Washington, D.C. sixth edition, 
1995, 1482p which are herein incorporated by reference. In 
one embodiment, vaccine compositions of the present 
invention are used for the treatment or prophylaxis of 
meningitis, otitis media, bacteremia or pneumonia, m one 
embodiment, vaccine compositions of the invention are used 
for the treatment or prophylaxis of streptococcus infection 
and/or diseases and symptoms mediated by streptococcus 
infection, in particular S.pneumoni group A 
streptococcus (pyogenes) . group B streptococcus (GBS or 
agalactiae) , dysgalactiae, uteris, nocardia as well as 
Staphylococcus aureus. In a further embodiment, the 
streptococcus infection is S. pneumonia 

In a particular embodiment, vaccines are administered to 
those individuals at risk of streptococcus infection such 
as infants, elderly and immunocompromised individuals. 

As used in the present application, the term • individuals- 
include mammals. In a further embodiment, the mammal is 
human. 



Vaccine compositions are preferably in unit dosage form of 
> about 0.001 to 100 jig/kg (antigen/body weight) and more 
preferably 0.01 to 10 ng/kg and most preferably 0.1 to 1 
Hg/kg 1 to 3 times with an interval of about 1 to 6 week 
intervals between immunizations. 



According to another aspect, there are provided 
polynucleotides encoding polypeptides characterized by the 
ammo' acid sequence chosen from SEQ id NOs: 2, 4, 6, 8, 10 
14, 16, 55 to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof. 
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in one embodiment, polynucleotides are those illustrated in 
SEQ ID Nos, 1, 3, 5, 7, 9 , 11# 12 , 13 , 15# ?fi/ 8Q/ ^ 

which may include the open reading frames (ORF) , encoding 
polypeptides of the invention. it will be appreciated that 
the polynucleotide sequences illustrated in the figures may 
be altered with degenerate codons yet still encode the 
polypeptides of the invention. Accordingly the present 
invention further provides polynucleotides which hybridize 
to the polynucleotide sequences herein above described (or 
the complement sequences thereof) having 50% identity 
between sequences, in one embodiment, at least 70% identity 
between sequences. In one embodiment, at least 75% identity 
between sequences, in one embodiment, at least 80% identity 
between sequences. In one embodiment, at least 85% identity 
between sequences, m one embodiment, at least 90% identity 
between sequences, in a further embodiment, polynucleotides 
are hybridizable under stringent conditions i.e. having at 
least 95% identity, m a further embodiment, more than 97% 
identity 



In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NOs : 1, 3 , 7, 9, 11, 12, 13, 15, 76, 
80, 82 encoding polypeptides of the invention. 

25 in a further embodiment, polynucleotides are those 

illustrated in SEQ ID NOs « 1, 3, 9, H, 12, 13, 15, 76, 
80, 82 which may include the open reading frames (ORF) 
encoding polypeptides of the invention. 

in a further embodiment, polynucleotides are those 
illustrated in SEQ id NOs , 1, 3, 9, 11, 12, 13, 15, 76 

which' may include the open reading frames (ORF), encoding 
polypeptides of the invention. 



35 In a further embodiment, polynucleotides 



are those 
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illustrated in SEQ ID NOs : 1, 3, 7, 9, H, 12, 13, 15, 76 

which may include the open reading frames (ORF) , encoding 
polypeptides of the invention. 

In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NOs : 1, 7, 9, 11, 15, 76 which may 
include the open reading frames (ORF) , encoding 
polypeptides of the invention. 

In a .further embodiment, polynucleotides are those 
illustrated in SEQ ID NOs : 1, 9, 11, 15, 76 which may 
include the open reading frames (ORF) , encoding 
polypeptides of the invention. 

In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NOs : 1, 7, 9, 11 which may include 

the open reading frames (ORF) , encoding polypeptides of the 
invention. 

In a further embodiment, polynucleotides are those 
illustrated in SEQ id NO : 1, encoding polypeptides of the 
invention. 

In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NO :7, encoding polypeptides of the 
invention. 

In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NO *9, encoding polypeptides of the 
invention. 

1 

In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NO ill, encoding polypeptides of the 
invention. . 
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In a further embodiment, polynucleotides are those 
illustrated in SEQ id NO :15, encoding polypeptides of the 
invention. 



In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NOs : 3, 12, 13, 76, encoding 
polypeptides of the invention. 

In a further embodiment, polynucleotides are those 
illustrated in SEQ id NO «3, encoding polypeptides of the 
invention . 



In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NO :12, encoding polypeptides of the 
invention . 



In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NO H3, encoding polypeptides of the 
invention . 

In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NO i76, encoding polypeptides of the 
invention. 



As will be readily appreciated by one skilled in the art, 
polynucleotides include both DNA and RNA. 

The present invention also includes polynucleotides 
complementary to the polynucleotides described in the 
present application. 

In a further aspect, polynucleotides encoding polypeptides 
of the invention, or fragments, analogs or derivatives 
thereof, may be used in a DNA immunization method. That 
is, they can be incorporated into a vector which is 
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replicable and expressible upon injection thereby producing 
the antigenic polypeptide in vivo. For example 
polynucleotides may be incorporated into a plasmid vector 
under the control of the CMV promoter which is functional 
in eukaryotic cells. Preferably the vector is injected 
intramuscularly . 

According to another aspect, there is provided a process 
for producing polypeptides of the invention by recombinant 
techniques by expressing a polynucleotide encoding said 
polypeptide in a host cell and recovering the expressed 
polypeptide product. Alternatively, the polypeptides can 
be produced according to established synthetic chemical 
techniques i.e. solution phase or solid phase synthesis of 
oligopeptides which are ligated to produce the full 
polypeptide (block ligation) . 

General methods for obtention and evaluation of 
polynucleotides and polypeptides are described in the 
following references: Sambrook et al, Molecular Cloning: A 
Laboratory Manual, 2nd ed, Cold Spring Harbor, N.Y., 1989; 
Current Protocols in Molecular Biology, Edited by Ausubel' 
F.M. et al., John Wiley and Sons, Inc. New York; PCR 
Cloning Protocols, from Molecular Cloning to Genetic 
Engineering, Edited by White B.A., Humana Press, Totowa, 
New Jersey, 1997, 490 pages; Protein Purification, 
Principles and Practices,. Scopes R.K. , Springer-Verlag, New 
York, 3rd Edition, 1993, 380 pages; Current Protocols in 
Immunology, Edited by Coligan J.E. et al . , John Wiley & 
Sons Inc . , New York which are herein incorporated by 
reference. 

i 

For recombinant production, host cells are transfected with 
vectors which encode the polypeptide, and then cultured in 
a nutrient media modified as appropriate for. activating 
promoters, selecting transformants or amplifying the genes. 
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Suitable vectors are those that are viable and replicable 
in the chosen host and include chromosomal, non-chromosomal 
and synthetic DNA sequences e.g. bacterial plasmids, phage 
DNA, baculovirus, yeast plasmids, vectors derived from 
combinations of plasmids and phage DNA. The polypeptide 
sequence may be incorporated in the vector at the 
appropriate site using restriction enzymes such that it is 
operably linked to an expression control region comprising 
a promoter, ribosome binding site (consensus region or 
Shine-Dalgarno sequence) , and optionally an operator 
(control element) . One can select individual components of 
the expression control region that are appropriate for a 
given host and vector according to established molecular 
biology principles (Sambrook et al, Molecular Cloning: A 
Laboratory Manual, 2nd ed. Cold Spring Harbor, N.Y., 1989; 
Current Protocols in Molecular Biology, Edited by Ausubel 
F.M. et al., John Wiley and Sons, Inc. New York 
incorporated herein by reference) . Suitable promoters 
include but are not limited to LTR or SV40 promoter, E.coli 
lac, tac or trp promoters and the phage lambda P t promoter. 
Vectors will preferably incorporate an origin of 
replication as well as selection markers i.e. ampicilin 
resistance gene. Suitable bacterial vectors include pET, 
PQE70, pQE60, pQE-9, pbs, pDIO phagescript, psiX174, 
pbluescript SK, pbsks, pNH8A, pNH16a, pNH18A, pNH46A, 
ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 and eukaryotic 
vectors pBlueBacIII, pWLNEO, pSV2CAT, pOG44, pXTl, pSG, 
PSVK3, pBPV, pMSG and pSVL. Host cells may be bacterial 
i.e. E.coli , Bacillus subtilis. streptomvces ; fungal i.e. 
Aspergillus niger, Aspergillus nidulins : yeast i.e. 
SaccharoTnypiag or eukaryotic i.e. CHO, COS. 

Upon expression of the polypeptide in culture, cells are 
typically harvested by centrifugation then disrupted by 
physical or chemical means (if the expressed polypeptide is 
not secreted into the media) and the resulting crude 



31 



WO 00/39299 



PCT/CA99/01218 



.extract retained to isolate the polypeptide of interest. 
Purification of the polypeptide from culture media or 
lysate may be achieved by established techniques depending 
on the properties of the polypeptide i.e. using ammonium 
5 sulfate or ethanol precipitation , acid extraction, anion 
or cation exchange chromatography, phosphocellulose 
chromatography, hydrophobic interaction chromatography, 
hydroxylapatite chromatography and lectin chromatography. 
Final purification may be achieved using HPLC. 

10 

The polypeptide may be expressed with or without a leader 
or secretion sequence. In the former case the leader may 
be removed using post-translational processing (see US 
4,431,739; US 4,425,437; and US 4,338,397 incorporated 
15 herein by reference) or be chemically removed subsequent to 
purifying the expressed polypeptide. 

According to a further aspect, the streptococcus 
polypeptides of the invention may be used in a diagnostic 
20 test for streptococcus infection, in particular S. 

pneumoniae infection. Several diagnostic methods are 
possible, for example detecting streptococcus organism in a 
biological sample, the following procedure may be followed: 
a) obtaining a biological sample from a patient; 
25 b) incubating an antibody or fragment thereof reactive 

with a streptococcus polypeptide of the invention with 
the biological sample to form a mixture; and 
c) detecting specifically bound antibody or bound 

fragment in the mixture which indicates the presence 
30 of streptococcus . 

Alternatively, a method for the detection of antibody 
specific to a streptococcus antigen in a biological sample 
containing or suspected of containing said antibody may be 
35 performed as follows: 

a) obtaining a biological sample from a patient; 
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b) incubating one or more streptococcus polypeptides of 

the invention or fragments thereof with the biological 

sample to form a mixture; and 
O detecting specifically bound antigen or bound fragment 

in the mixture which indicates the presence of 

antibody specific to streptococcus. 

One of skill in the art will recognize that this diagnostic 
test may take several forms, including an immunological 
test such as an enzyme-linked immunosorbent assay (ELISA) , 
a radioimmunoassay or a latex agglutination assay, 
essentially to determine whether antibodies specific for 
the protein are present in an organism. 



The DNA sequences encoding polypeptides of the invention 
may also be used to design DNA probes for use in detecting 
the presence of streptococcus in a biological sample 
suspected of containing such bacteria. The detection 
method of this invention comprises: 
20 a) obtaining the biological sample from a patient; 
b) incubating one or more DNA probes having a DNA 

sequence encoding a polypeptide of the invention or 
fragments thereof with the biological sample to form a 
mixture; and 

25 c) detecting specifically bound DNA probe in the mixture 
which indicates the presence of streptococcus 
bacteria. 



The DNA probes of this invention may also be used for 
detecting circulating streptococcus i.e. 

S.pneumoniaen ucleic acids in a sample, for example using a 
polymerase chain reaction, as a method of diagnosing 
streptococcus infections. The probe may be synthesized 
using conventional techniques and may be immobilized on a 
solid phase, or may be labelled with a detectable label. ; 
preferred DNA probe for this application is an oligomer 
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having a sequence complementary to at least about 6 
contiguous nucleotides of the streptococcus pneumoniae 
polypeptides of the invention. 

5 Another diagnostic method for the detection of 
streptococcus in a patient comprises: 

a) labelling an antibody reactive with a polypeptide of 
the invention or fragment thereof with a detectable 
label; 

10 b) administering the labelled antibody or labelled 
fragment to the patient; and 
c) detecting specifically bound labelled antibody or 

labelled fragment in the patient which indicates the 
presence of streptococcus. 

15 

A further aspect of the invention is the use of the 
streptococcus polypeptides of the invention as immunogens 
for the production of specific antibodies for the diagnosis 
and in particular the treatment of streptococcus infection. 
20 Suitable antibodies may be determined using appropriate 

screening methods, for example by measuring the ability of 
a particular antibody to passively protect against 
streptococcus infection in a test model. One example of an 
animal model is the mouse model described in the examples 
25 herein. The antibody may be a whole antibody or an 

antigen-binding fragment thereof and may belong to any 
immunoglobulin class. The antibody or fragment may be of 
animal origin, specifically of mammalian origin and more 
specifically of murine, rat or human origin. It may be a 
natural antibody or a fragment thereof, or if desired, a 
recombinant antibody or antibody fragment. The term 
recombinant antibody or antibody fragment means antibody or 
antibody fragment which was produced using molecular 
biology techniques. The antibody or antibody fragments may 
be polyclonal, or preferably monoclonal. It may be. 
specific for a number of epitopes associated with the 
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streptococcus pneumoniae polypeptides but is preferably 
specific for one. 

Without limiting its scope, the present invention also 
relates to new antigens designated BVH-3, BVH-11, BVH-11-2, 
BVH-28 and BVH-71. The present invention also relates to 
truncated polypeptides comprising fragments of the new 
antigens designated BVH-3, BVH-11, BVH-11-2, BVH-28 and 
BVH-71. The present invention also relates to chimeric 
polypeptides comprising fragments of the new antigens 
designated BVH-3, BVH-11, BVH-11-2, BVH-28 and BVH-71. The 
following is a reference table summarizing the relation 
between the antigens of the present invention: 



Family 


Nucleotide SEQ id 
NO 


Polypeptide SEQ ID 
NO 


BVH-3 — ■ 


BVH-3 


1. 11 


2 


BVH-3 A 


7 


8 


BVH-3B 


9 


10 


BVH-3 SP63 


15 


16 


BVH-3M 




55 


BVH-3 AD 




56 


L-BVH-3AD 




57 


Newl2 


76 


58 


BVH-3C 




59 


Newl 




64 


New2 




65 


New3 




66 


Newl 5 




78 


BVH-11 




BVH-11 


3, 12 


4 


BVH-11-2 


13 


14 


BVH-11M 




60 


BVH-11A 




61 


BVH-11B also 
referred to as 
NEW13 




62 


BVH-11C 




63 


New4 




67 


New5 




68 
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Family 


Nucleotide SEQ id 
NO 


Polypeptide SEO ID 
NO 


New 6 




69 


New7 




70 


New8 




71 


New9 




72 


BVH-11-2M 




73 


NewlO 




74 


Newll 




75 


Newl2 


76 


58 


Newl4 




77 


Newl6 




79 


BVH-28 ■ 


BVH-28 


5 


6 


BVH-71 


GBS 


80 


81 


GAS 


82 


83 



EXAMPLE 1 



This example illustrates the cloning of S^ pneumoniae genes. 

The coding region of S_^ pneumoniae gene BVH-3 (SEQ ID NO: 1) 
and the coding region of S^ pneumoniae gene BVH-28 (SEQ ID 
NO: 5) were amplified by PCR (DNA Thermal Cycler GeneAmp 
PCR system 2400 Perkin Elmer, San Jose, CA) from genomic DNA 
of serogroup 6 S_^ pneumoniae strain SP64 using the oligos 
that contained base extensions for the addition of 
restriction sites Bglll (AGATCT) and Xbal (TCTAGA) . PCR 
products were purified from agarose gel using a QIAquick gel 
extraction kit from QIAgen (Chatsworth, CA) , digested Bglll- 
Xbal (Pharmacia Canada Inc, Baie d'Urfe, Canada), extracted 
with phenol : chloroform and precipitated with ethanol. The 
Superlinker vector pSL301 (Invitrogen, San Diego, CA) was 
digested with Bglll and Xbal and purified from agarose gel 
using a QIAquick gel extraction kit from QIAgen (Chatsworth, 
CA) . The Bglll-Xbal genomic DNA fragments were ligated to 
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the Bglll-Xbal pSL301 vector. The ligated products were 
transformed into coli strain DH5a [f80 lacZ DM15 endAl 
reckl hsdRll ( r K- m K + ) supE44 thi-lV gyrA96 relAl D ( lacZYA- 
argrF)U169] (Gibco BRL, Gaithersburg, MD) according to the 
method of Simanis (Hanahan, D. DNA Cloning, 1985, D.M. 
Glover (ed) , pp. 109-135). Recombinant pSL3 01 plasmids 
(rpSL301) containing either BVH-3 or BVH-28 gene were 
purified using a QIAgen kit (Chatsworth, CA) and DNA inserts 
were confirmed by nucleotide sequence analysis (Taq Dye 
Deoxy Terminator Cycle Sequencing kit, ABI, Foster City, 
CA) . Recombinant rpSL301 (rpSL301) were digested with the 
restriction enzymes Bglll (AGATCT) and Xhol (CTCGAG) . DNA 
fragments Bglll-Xhol were purified using the QlAquick gel 
extraction kit from QIAgen (Chatsworth, CA) . pET-32c(+) 
expression vector (Novagen, Madison, WI) containing the 
thioredoxin-His-Tag sequence was digested with BamHI 
(GGATCC) and Xhol and gel extracted using the QlAquick gel 
extraction kit from QIAgen (Chatsworth, CA) . The Bglll-xhol 
DNA fragments were ligated to the BamHI-XhoI pET-32c(+) 
vector to create the coding sequence for thioredoxin- 
Hi s • Tag-BVH- 3 or thioredoxin-His • Tag-BVH-28 fusion protein. 
The ligated products were transformed into E_;_ coli strain 
DH5a tf80 lacZ DM15 endAl recAl hsdR17 ( r K _m K + ) supE44 thi-11" 
gyrA96 relAl D(lacZYA-argF)U169] (Gibco BRL, Gaithersburg, 
MD) according to the method of Simanis (Hanahan, D. DNA 
Cloning, 1985, D.M. Glover (ed) , pp. 109-135). Recombinant 
pET-32c(+) plasmids were purified using a QIAgen kit 
(Chatsworth, CA) and the nucleotide sequences at the fusion 

i 

sites of thioredoxin-His-Tag and DNA insert were verified by 
DNA sequencing (Taq Dye Deoxy Terminator Cycle Sequencing 
kit, ABI, Foster City, CA) . 
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EXAMPLE 2 



10 



15 



25 



30 



This example illustrates the cloning of pneumoniae 
5 protein genes in CMV plasmid pCMV-GH. 

The DNA coding region of a pneumoniae protein was 
inserted in phase downstream of a human growth hormone (hGH) 
gene which was under the transcriptional control of the 
cytomegalavirus (CMV) promoter in the plasmid vector pCMV-GH 
(Tang et al.. Nature, 1992, 356 :152). The CMV promoter is 
non functional plasmid in E. coli cells but active upon 
administration of the plasmid in eukaryotic cells. The 
vector also incorporated the ampicillin resistance gene. 

The coding region of BVH-3 gene (SEQ ID NO* 1) and BVH-28 
gene (SEQ id no. 5) were obtained from rpSL301 (see example 
1) using restriction enzymes Bglll (AGATCT) and Xbal 
(TCTAGA) . The digested products were purified from agarose 
gel using the QIAquick gel extraction kit from QIAgen 
(Chatsworth, CA) . The pCMV-GH vector (Laboratory of Dr. 
Stephen A. Johnston, Department of Biochemistry, The 
University of Texas, Dallas, Texas) containing the human 
growth hormone to create fusion proteins was digested with 
Bglll and Xbal and purified from agarose gel using the 
QIAquick gel extraction kit from QIAgen (Chatsworth, CA) . 
The Bglli-xbal DNA fragments were ligated to the Bglii-xbal 
PCMV-GH vector to create the hGH- BVH-3 or hGH-BVH-28 fusion 
protein under the control of the CMV promoter. The ligated 
products were transformed into coli strain DH5arf80 lacZ 
DM15 endAl recAl hsdRll <*k-**) supBU thi-lV gyrKSS relKl 
D(lacZYA-ar ff F)U169] (Gibco BRL, Gaithersburg, MD) according 
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to the method of Simanis (Hanahan, D. DNA Cloning, 1985, 
D.M. Glover (ed) , pp. 109-135). The recombinant pCMV 
plasmids were purified using a QIAgen kit (QIAgen, 
Chatsworth, CA) . 



The coding region of BVH-11 gene (SEQ ID NO: 3) was 
amplified by PCR (DNA Thermal Cycler GeneAmp PCR system 2400 
Perkin Elmer, San Jose, CA) from genomic DNA of serogroup 6 

pneumoniae strain SP64 using the oligos that contained 
base extensions for the addition of restriction sites Bglli 
(AGATCT) and Hindlll (AAGCTT) . The PCR product was purified 
from agarose gel using a QIAquick gel extraction kit from 
QIAgen (Chatsworth, CA) , digested with restriction enzymes 
(Pharmacia Canada Inc, Baie d'Urfe, Canada), extracted with 
phenol : chloroform and precipitated with ethanol. The 
pCMV-GH vector (Laboratory of Dr. Stephen A. Johnston, 
Department of Biochemistry, The University of Texas, Dallas, 
Texas) was digested with Bglli and Hindlll and purified from 
agarose gel using the QIAquick gel extraction kit from 
QIAgen (Chatsworth, CA) . The Bglll-HindHI DNA fragment was 
ligated to the Bglll-HindHI pCMV-GH vector to create the 
hGH-BVH-11 fusion protein under the control of the CMV 
promoter. The ligated products were transformed into coli 
strain DH5a[f80 lacZ DM15 endAl recAl iisdR17 ( r K _m K + ) SU pE44 
thi-ir gyrA96 relAl D (lacZYA-argF) U169] (Gibco BRL, 
Gaithersburg, MD) according to the method of Simanis 
(Hanahan, D. DNA Cloning, 1985, D.M. Glover (ed) , pp. 109- 
135) The recombinant pCMV plasmid was purified using a 
QIAgen kit (Chatsworth, CA) and the nucleotide sequence of 
the DNA insert was verified by DNA sequencing. 
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EXAMPLE 3 

This example illustrates the use of DNA to elicit an immune 
response to pneumoniae antigens. 

A group of 8 female BALB/c mice (Charles River, St-Constant, 
Quebec, Canada) were immunized by intramuscular injection of 
50 m three times at two- or three-week intervals with 100 
WT of recombinant pCMV-GH encoding the BVH-3, BVH-11 or the 
BVH-28 gene in presence of 50 ug of granulocyte-macrophage 
colony-stimulating factor (GM-CSF) - expressing plasmid pCMV- 
GH-GM-CSF (Laboratory of Dr. Stephen A. Johnston, Department 
of Biochemistry, The University of Texas, Dallas, Texas) . 
As control, a group of mice were injected with 100 ug of 
15 pCMV-GH in presence of 50 ug of pCMV-GH-GM-CSF. Blood 
samples were collected from the orbital prior to each 
immunization and seven days following the third injection 
and serum antibody responses were determined by ELISA using 
thioredoxin-His-Tag-S^ pneumoniae fusion protein as coating 
20 antigen. DNA immunization with recombinant plasmid pCMV-GH 
encoding the BVH-3, BVH-11 or the BVH-28 S_, pneumoniae 
protein induced antibody reactive against the respective 
recombinant protein. The reciprocal antibody titers, defined 
as the highest serum dilution at which the absorbance values 
25 were 0.1 above the background values, were above 4xl0 3 . 



EXAMPLE 4 



30 



This example illustrates the production and purification of 
recombinant S^ pneumoniae proteins. 
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The recombinant pET plasmids containing the BVH-3, BVH-11 or 
the BVH-28 gene corresponding to the SEQ ID NO: 1 , SEQ ID 
NO: 3 or the SEQ ID NO: 5 respectively were transformed by 
electroporation (Gene Pulser II apparatus, BIO-RAD Labs, 
Mississauga, Canada) into coli strain AD494 (DE3) (Dara" 
2eu7697 DIacX74 DphoA PvuII phoR BmalF3 F' [lac* (lacl*) pro] 
fcrxB::Kan) (Novagen, Madison, WI) . In this strain of 
coli, the T7 promotor controlling expression of the fusion 
protein is specifically recognized by the T7 RNA polymerase 
(present on the 1DE3 prophage) whose gene is under the 
control of the lac promotor which is inducible by isopropyl- 
fi-d-thio-galactopyranoside (IPTG) . The transformant 
AD494(DE3)/rpET was grown at 37°C with agitation at 250 rpm 
in LB broth (peptone 10g/L, yeast extract 5g/L, NaCl lOg/L) 
containing lOOug of ampicillin ( Sigma -Aldrich Canada Ltd., 
Oakville, Canada) per ml until the Agoo reached a value of 
0.6. In order to induce the production of the thioredoxin- 
His • Tag- BVH-3 , thioredoxin-His-Tag-BVH-11 or thioredoxin- 
His-Tag-BVH-28 fusion protein, the cells were incubated for 
2 additional hours in the presence of IPTG at a final 
concentration of 1 mM. Induced cells from a 100 ml culture 
were pelleted by centrifugation and frozen at -70°C. 

The purification of the fusion proteins from the soluble 
cytoplasmic fraction of IPTG- induced AD494 (DE3) /rpET was 
done by affinity chromatography based on the properties of 
the His -Tag sequence (6 consecutive histidine residues) to 
bind to divalent cations (Ni 2+ ) immobilized on the His -Bind 
metal chelation resin. Briefly, the pelleted cells obtained 
from a lOOmL culture induced with IPTG were resuspended in 
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Phosphate-buffered (PBS) :500iriM NaCl pH7.1, sonicated and 
spun at 20,000 X g for 20 min to remove debris. The 
supernatant was filtered (0.22um pore size membrane) and 
deposited on a HiTrap® ImL chelating pre-packed ready-to-use 
column (Pharmacia Biotech, Baie d'Urfe, Canada) . The 
thioredoxin-His-Tag-S^ pneumoniae fusion protein was eluted 
with 1M imidazole-500mM NaCl-PBS pH7.1. The removal of the 
salt and imidazole from the sample was done by dialysis 
against PBS at 4°C. The quantities of fusion protein 
obtained from the soluble fraction of E. coli was estimated 
by MicroBCA (Pierce, Rockford, Illinois) . 



EXAMPLE 5 



15 



20 



25 



30 



This example illustrates the protection of mice against 
fatal pneumococcal infection by immunization. 

Groups of 8 female BALB/c mice (Charles River) were 
immunized subcutaneous ly three times at three-week intervals 
with either 25 ng of affinity purified thioredoxin-His-Tag- 
BVH-3 fusion protein in presence of 15 jig of QuilA adjuvant 
(Cedarlane Laboratories Ltd, Hornby, Canada) or, as control, 
with QuilA adjuvant alone in PBS. Blood samples were 
collected from the orbital sinus on day 1, 22 and 43 prior 
to each immunization and seven days (day 50) following the 
third injection. One week later the mice were challenged 
with approximately 10 6 CFU of the type 3 S^ pneumoniae 
strain WU2. Samples of the S^. pneumoniae challenge inoculum 
were plated on chocolate agar plates to determine the CFU 
and to verify the challenge dose. Deaths were recorded for 
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a period of 14 days and on day 14 post-challenge, the 
surviving mice were sacrificed and blood samples tested for 
the presence of pneumoniae organisms. The survival data 
are shown in table 1. 



10 



15 



Prechallenge sera were analyzed for the presence of 
antibodies reactive with pneumoniae by standard 
immunoassays. Elisa and immunoblot analyses indicated that 
immunization with recombinant S, pneumoniae protein produced 
in cpli elicited antibodies reactive with both, 
recombinant and native pneumococcal protein. 

Table 1. Protection mediated by recombinant BVH-3 protein 



Intmunogen 


No. of mice alive : no. of mice 
dead 

14 days post -challenge 


Median day of 
death 


BVH-3 


8 : 0 ■ 


>14 


none 


0 : 8 " 


1 



20 



All mice immunized with BVH-3 recombinant protein survived 
to infection while none of the control mice given adjuvant 
alone survived. There was a significant difference in 
survival between the two groups of mice (P<0.0001, log rank 
test for nonparametric analysis of survival curves; 
P=0.0002, Fisher's exact test). All hemocultures from 
surviving mice were negative at day 14 post-challenge. 
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This example describes the cloning of BVH-3 and BVH-11 
genes from a variety of pneumoniae strains and the 
molecular conservation of these genes. 

Molecular analysis of chromosomal DNA from various 
pneumoniae isolates with DNA probes spanning different 
regions of BVHzl or BVH-11 revealed the presence of one 
BVH^l gene copy and two BVH-11 gene copies. The two BVH-11 
gene copies are not identical and the genes were 
arbitrarily designated BVH-11 (SEQ ID NO: 12; ORF at 
nucleotides 45 to 2567) and BVH-11-2 (SEQ ID NO: 13; ORF at 
nucleotides 114 to 2630) . 

The first amino acids of the BVH-3 and BVH-11 coding 
regions have the characteristics of leader sequences also 
known as signal peptides. The consensus signal peptidase 
cleavage site L-X-X-C of lipoprotein 

modification/processing sites was present in the sequences 
Mature BVH-3, BVH-11 and BVH-11-2 proteins from £L_ 
pneumoniae SP64 have 1019, 821 and 819 amino acids, 
respectively. The regions of S_ pneumonia genes coding 
for mature BVH-3, termed BVH-3M, (nucleotides 1837 - 4896; 
SEQ. ID. NO: 11), BVH-llM (nucleotides 102-2567; SEQ. ID.' 
NO: 12) and BVH-11-2M (nucleotides 171-2630; SEQ. ID. NO:' 
13), were amplified by PCR(DNA Thermal Cycler GeneAmp PCR 
system 2400 Perkin Elmer, San Jose, CA) from genomic DNA of 
6 ° r7L pneumoniae strains. Serogroup 6 pneumoniae 
SP64 and serogroup 9 SP63 clinical isolates were provided 
by the laboratoire de la sante publique du Quebec, Sainte- 
Anne-de-Bellevue; serotype 4 strain JNR.7/87 was provided 
by Andrew Camilli, Tufts University School of Medicine, 
Boston; Rxl strain, a nonencapsulated derivative of the 
type 2 strain D39 and the type 3 strains A66 and WU2 were 
provided by David E. Briles from University of Alabama, 
Birmingham and the type 3 clinical isolate P4241 was 
provided by the centre de recherche en infectiologie du 
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centre hospitaller de 1' university Laval, Sainte-Foy. The 
sets of oligonucleotide primers OCRR479-OCRR480; HAMJ160- 
OCRR488 and HAMJ160-HAMJ186, that contained base extensions 
for the addition of restriction sites were used for the 
amplification of BVH- 3, BVH- 11 and BVH- 11- 2 gene, 
respectively, with the exception of BVH- 11 gene from SP64 
strain which was amplified using the set of primers 
consisting of HAMJ487 and OCRR488. Primer sequences are 
listed below (Table 2). PC R products were purified from 
agarose gel using a QIAquick gel extraction kit from QiAgen 
(Chatsworth, CA) and digested Bglli-xbal or Bglll-Hindlll 
(Pharmacia Canada Inc. Baie d'Urfe, Canada). Digestions 
were cleaned using a QIAquick PCR purification kit from 
QIAgen (Chatsworth, CA) . The PCR products were ligated to 
the Bglli-xbal or Bglll-Hindlll pSL301 vector. The ligated 
products were transformed into E_j_ coli strain DH5a £080 
lacZ AM15 endAl recAl hsdR17 ('yTk*) supE44 thi-lV gyrA96 
relAl A(I ac ZYA-arsrF)U169] (Gibco BRL, Gaithersburg, MD) 
according to the method of Simanis (Hanahan, D. DNA 
Cloning, 1985, D.M. Glover (ed) , pp. 109-135). Recombinant 
PSL301 plasmids (rpSL301) containing BVH-3 . BVH- 11 or 
BVH11 " 2 were Purified using a QIAgen kit (Chatsworth, CA) 
and DNA inserts were sequenced (Taq Dye Deoxy Terminator 
Cycle Sequencing kit, ABI, Foster City, CA) . The figures 11 
and 12 depict the consensus sequence established from the 
BVH ~ 3 ' 311(1 BVH- 11 deduced amino acid sequences, 
respectively. Comparison of BVH-3 protein sequences 
revealed 99 to 100% identity of sequences for all strains 
with the exception that BVH-3 from serogroup 9 SP63 strain 
30 (SEQ. ID. NO: 15 and SEQ. ID. NO: 16) misses a stretch of 
177 amino acids corresponding to residues 244 to 42C on 
BVH-3 'protein sequence of pneumoniae S P64. Analysis of 
sequences of additional serogroup 9 strains revealed BVH-3 
molecule having the same deletion in 3 out of 4 strains 
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thus suggesting that the 3 strains are members of a S. 
pneumoniae serogroup 9 clone. 

Comparison of 13 BVH-11 nucleotide sequences obtained from 
7 £L_ pneumoniae strains, revealed that the nucleotide 
sequences are very similar. Computer analysis {MacVector, 
Clustal w 1.4) using multiple alignment of the predicted 
BVH-11 protein sequences revealed that these sequences were 
75% identical and 82 % homologous on a length of 834 amino 
acids. Pairwise alignment revealed 80 to 100% identity 
(Figure 13) . The sequences showed great similarity in 
overall organization. Variability in the primary sequence 
of these proteins is almost restricted to the last 125 
amino acids in the C- terminal portion of the proteins. This 
region constitutes a domain. Close examination of this 
domain revealed two groups of sequences. The first 9 
sequences from the figure 13 belong to one group while the 
last 4 sequences belong to another group. A 39% identity 
value is obtained when the domain sequences of the 13 
proteins are compared (MacVector, Clustal W 1.4). The 
identity value increased to more than 92% when sequences 
belonging to a same group are compared. 



EXAMPLE 7 



This example illustrates the homology of portions of BVH-3 
and BVH-11 genes. 

Molecular analysis with DNA probes derived from BVH-3 and 
M2zll genes indicated that BVH-3 and BVH-11 were related. 
In dot blot hybridization studies, DNA probe consisting of 
either, BVH-3 or BVH-11, gene sequence hybridized to both, 
BVH ~ 3 and BVH-11 genes thus indicating that BVH-3 and BVH- 
11 genes shared homologous sequences. Comparison of 
sequences revealed that the ORFs and the proteins were 43 
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and 33% identical, respectively. Closer examination 
revealed that the region corresponding to amino acids 1 to 
225 in BVH-3 and 1 to 228 in BVH-11 were 73 and 75% 
identical at the DNA and protein level, respectively. In 
contrast, the 3' regions corresponding to amino acids 226 
to 1039 from BVH-3 and amino acids 229-840 from BVH-11 were 
only 34 and 22% identical at the DNA and protein level, 
respectively. Thus the 5' termini of BVH-3 and BVH-11 
genes appear to contain highly conserved sequences while 
the remaining parts of the genes are highly divergent. 
These results suggest that BVH-3 and BVH-11 might share 
similar functions mediated by sequences present in the 
conserved region whereas BVH-3- and BVH-ll-specif ic 
functions might be mediated by sequences in the divergent 
15 region. 
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This example describes the cloning of truncated BVH-3 . BVH- 
11 and BVH-11-2 genes by polymerase chain reaction (PCR) 
and the expression of truncated BVH-3 and BVH-11 molecules. 

Gene fragments were amplified by PCR using pairs of 
oligonucleotide engineered to amplify fragments spanning 
the BVH-3 (SEQ ID NO: 1 and SEQ ID NO: 11) , BVH-11 (SEQ ID 
NO: 3 and SEQ ID NO: 12) or BVH-11-2 (SEQ. ID NO: 13) gene 
from pneumoniae strain SP64. Each of the primers had a 
restriction endonuclease site at the 5' end, thereby 
allowing directional in-frame cloning of the amplified 
product into the digested plasmid vector (Tables 2 and 3). 
PCR-amplified products were digested with restriction 
endonucleases and ligated to either linearized plasmid 
PSL301 (see example 1), pCMV-GH (see example 2) or pET 
35 (Novagen, Madison, WI) expression vector digested likewise 
or digested with enzymes that produce compatible cohesive 

47 
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ends. Recombinant pSL301 and recombinant pCMV-GH. plasmids 
were digested with restriction enzymes for the in-frame 
cloning in pET expression vector. Clones were first 
stabilized in E^ coli DH5a before introduction into coli 
5 BL2KXDE3) or AD494 (XDE3) for expression of truncated BVH- 
3 or BVH-11 molecules. Each of the resultant plasmid 
constructs was confirmed by nucleotide sequence analysis. 
The recombinant proteins were expressed as N-terminal 
fusions with the thioredoxin and His-tag or as C-terminal 

10 fusions with an His-tag. The expressed recombinant 

proteins were purified from supernatant fractions obtained 
from centrifugation of sonicated IPTG- induced E^. coli 
cultures using a His-Bind metal chelation resin (QlAgen, 
Chatsworth, CA) . The gene products generated are listed in 

15 the table 3. The gene products corresponding to the N- 
terminal region including the signal sequence are 
designated as Lipidated-proteins or lipoproteins (L- 
proteins) . The gene products corresponding to the N- 
terminal region lacking the signal sequence are identified 

20 as protein without signal sequence (w/o ss) . 



Table 2. List of PCR oligonucleotide primers 



Primer 


SEQ. 
ID. 


Sequence 5' - 3' 


Nucleotide 
position 


Restric- 
tion 
sites 


OCRR479 


17 


cagtagatctgtgcctatgcactaaac 


SEQ ID 1:61- 
78 


Bgin 


COIR 480 


18 


gatctctagactactgctattccttacgctatg 


SEQ 

ID 11 :4909- 
4887 


Xbal 


OCRR497 

I 


19 


atcactcgagcattacctggataatcctgt 


SEQ 

ID 1 :1525- 
1506 


Xhol 


OCRR498 


20 


ctgctaagcttatgaaagatttagat 


SEQ 

ID 1 :1534- 
1548 


Hindm 
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OCRR499 


21 


eatactceasct&ctattccttac 


pen 

ID 11 :4906- 
4893 


Xhol 


HAMJ 172 


22 


gaatctcgagttaagctgctgctaattc 


SEQ ID 1 . 
675-661 


Xhol 


HAMJ247 


23 


gacgctcgagcgctatgaaatcagataaattc 


SEQ ID 

1 :3 117-3096 


Xhol 


HAMJ 248 


24 


gacgctcgagggcattacctggataatcctgttcatg 


SEQ ID 

1 : 1527-1501 


Xhol 


HAMJ 249 


25 


cagtagatctcttcatcatttattgaaaagagg 


SEQ ID 11 : 
1749-1771 


BglH 


HAMJ 278 


26 


ttatttcttccatatggacttgacagaagagcaaattaag 


SEQ ID 

1 : 1414-1437 


Ndel 


HAMJ 279 


27 


cgccaagcttcgctatgaaatcagataaattc 


SEQ ID 

1 :3 117-3096 


HindHI 


HAMJ 280 


28 


cgccaagcttttccacaatataagtcgattgatt 


SEQ ID 

1 :2400-2377 


Hindm 


HAMJ 281 


29 


ttatttcttccatatggaagtacctatcttggaaaaagaa 


SEQ ID 

1 :2398-2421 


Ndel 


HAMJ 300 


30 


ttatttcttccatatggtgcctatgcactaaaccagc 


SEQ ID 1 :62- 
82 


Ndel 


HAMJ 313 


31 


ataagaatgcggccgcttccacaatataagtcgattgatt 


SEQJD 

1 :2400-2377 


NotI 


OCRR487 


32 


cagtagatctgtgcttatgaactaggtttgc 


SEQ ID 3 :58- 
79 


BglH 


OCRR 488 


33 


gatcaagcttgctgctacctttacttactctc 


SEQ ID 

12 :2577-2556 


HindJH 


HAMJ 171 


34 


ctgagatatccgttatcgttcaaacc 


SEQ ID 

3 : 1060-1075 


EcoRV 


HAMJ 251 


35 


ctgcaagcttttaaaggggaataatacg 


SEQ ID 

3 : 1059-1045 


HindJJJ 


HAMJ 264 


36 


cagtagatctgcagaagccttcctatctg 


SEQ ID 3 :682- 
700 


BglH 


HAMJ 282 


37 


tcgccaagcttcgttatcgttcaaaccattggg 


SEQJD 

3 : 1060-1081 


Hindm 


HAMJ 283 


38 


ataagaatgcggccgccttactctcctttaataaagccaat 
agtt 


SEQ ID 

3 :2520-2492 


Ndel 


HAMJ 284 


39 


catgccatggacattgatagtctcttgaaacagc 


SEQ ID 3 :856- 
880 


Ncol 


HAMJ 285 


40 


cgccaagcttcttactctcctttaataaagccaatag 


SEQ ID 

3 :2520-2494 


Hindm 


IJ A A/TT OQiC 

xlAMJ ZoO 


A 1 

41 


cgacaagcttaacatggtcgctagcgttacc 


SEQ ID 
3:2139-2119 


Hindm 


HAMJ 287 - 


XI 


cataccatgggcctttatgaggcacctaag 


SEQ ID 

3 :2014-2034 


Ncol 


HAMJ 288 ' 


« i 


cgacaagcttaagtaaatcttcagcctctctcag 


SEQ ID 

i :2376-2353 


Hindm 
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HAMJ289 


44 


gataccatggctagcgaccatgttcaaagaa 


SEQJD 

3 :2125-2146 


Ncol 


HAMJ290 


45 


cgccaagcttatcatccactaacttgactttatcac 


SEQJD 
3:1533-1508 


HindJJJ 


HAMJ291 


46 


cataccatggatattcttgccttcttagctccg 


SEQJD 
3:1531-1554 


Ncol 


HAMJ301 


47 


catgccatggtgcttatgaactaggtttgc 


SEQ ID 3 :59- 
79 


Ncol 


HAMJ302 


48 


cgccaagctttagcgttaccaaaaccattatc 


SEQID 

3 :2128-2107 


Hindlil 


HAMJ 160 


49 


gtattagatctgttcctatgaacttggtcgtcacca 


SEQ ID 13 : 
172-196 


Bglll 


HAMJ 186 


50 


cgcctctagactactgtataggagccgg 


SEQ JD 13: 
2460-2443 


Xbal 


HAMJ292 


51 


catgccatggaaaacatttcaagccttttacgtg 


SEQJD 11: 
754-778 


Ncol 


XlAMJ Zirj 


32 


cgacaagcttctgtataggagccggttgactttc 


SEQJD 11 : 
2457-2434 


HindDI 


HAMJ 294 


53 


catgccatggttcgtaaaaataaggcagaccaag 


SEQJD 11 : 
2038-2062 


Ncol 


HAMJ 297 


54 


catgccatggaagcctattggaatgggaag 


SEQJD 11 : 
622-642 


Ncol 
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EXAMPLE 9 



This example describes the isolation of monoclonal 
antibodies (Mabs) and the use of Mabs to characterize BVH- 
3, BVH-11 and BVH-11-2 protein epitopes. 

Female BALB/c mice (Charles River) were immunized 
subcutaneously with BVH-3 f BVH-11 or BVH-11-2 gene products 
from pneumoniae strain SP64 in presence of 15 ng of 
QuilA adjuvant (Cedarlane Laboratories Ltd, Hornby, 
Canada). One set of mice (fusion experiment 1) were 
immunized on day 1 and 14 with 25 ng of affinity purified 
thioredoxin-His.Tag-BVH-3M fusion protein. A second group 
of mice (fusion experiment 2) were immunized three times at 
three-week intervals with 25 ng of affinity purified 
thioredoxin-His.Tag-BVH-llM. A third group of mice (fusion 
experiment 3) were immunized on day 1 and day 15 with 25 jtg 
of affinity purified thioredoxin-His»Tag-BVH-ll-2M fusion 
protein. A fourth group of mice (fusion experiment 4) were 
immunized on day 1 with 25 ng of affinity purified 
thioredoxin-His.BVH-HB fusion protein and boosted by 
intravenous injection on day 16 and on day 37 with 
recombinant BVH-11B in PBS. Three to four days before 
fusion, mice were injected intravenously with 25 ng of the 
respective antigen suspended in PBS alone. Hybridomas were 
produced by fusion of spleen cells with nonsecreting SP2/0 
myeloma cells as previously described by J. Hamel et al . 
[J. Med. Microbiol., 23, ppl63-170 (1987)]. Culture 
supernatants of hybridomas were initially screened by 
enzyme-linked-immunoassay according to the procedure 
described by Hamel et al. (Supra) using plates coated with 
preparations of purified recombinant proteins or 
suspensions of heat-killed S^ pneumoniae cells. Positive 
hybridomas selected on the basis of EL ISA reactivity with a 
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variety of antigens were then cloned by limiting . dilutions, 
expanded and frozen. 



10 



Hybridomas were tested by EL ISA or Western immunoblotting 
against MUzl and BVH-11 gene products in order to 
characterize the epitopes recognized by the Mabs. BVH-3 
and BVH-11 shared common epitopes with 6 Mabs (H3-1-F9, H3- 
1-D4, H3-1-H12, H11-1-E7, H11-1-H10 and Hll-1 . 1-G11) 
showing reactivities with both proteins (Table 4) . BVH-11 
and BVH-11-2 molecules from pneumoniae SP64 shared 
common epitopes not present on BVH-3 with Mabs (3A1, 13C11 
10H10, IDS, 10G9, 10A2, 3E8, 10D7, 2H7 and 6H7) reactive 
with both, BVH-11 and BVH-11-2, recombinant proteins (Table 
5) . 



15 



Table 4 



Reactivity of BVH-3-immunoreactive Mabs with a 





a. Imrnun 


oreactiv 


Ity with 


MAbs 


BVH-3M 
21-1039 


BVH-3 A 
21-509 


BVH-3B 
512-1039 


BVH-3C 
21-225 


NEW2 
472-800 


NEW3 

800-1039 


BVH-11M 
20-840 


H3-1-F9 


+ 


+ 




+ 






+ 


H3-1-D4 


+ 


+ 




+ 






+ 


H3-1-H12 


+ 


+ 




+ 






+ 


H3-2-G2 


+ • 


+ 












H3-3-A1 


+ 


+ 












H3-4-D3 


+ 




+ 






+ 




H11-1-E7 


+ 


+ 




+ 






+ 


Hll-l- 
H10 


+ 


+ 




+ 






+ 


Hll- 
1.1-G11 


+ 


+ 




+ 


+ 




+ 
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Table 5. Reactivity of Mabs raised against BVH-11-2 
protein from L pneumoniae strain SP64 with a panel of BVH- 
JL1 gene products 



Mabs 


b. Immunoreactivity with ~ " " 


C . dVH- 


11 proc 


iucts Id. BVH-11-2 products 


BVH-11M 
20-840 


NEW8 
286-511 


NEW9 
511-713 


BVH-11B 
354-840 


[BVH-11-2 
20-838 


NEW10 
271-838 


NEW11 
699-838 


NEW14 • 
??7-fiQQ 


3A1 


+ 


+ 





+ 


+ 


4. 
i 




+ 


13C1 


+ 


+ 


+ 






i 
i 




+ 


10H10 


+ 


+ 


+ 


+ 








+ 


1D8 


+ 


+ 


- 


+ 


+ 


+ 




+ 


10G9 


+ 


- 




+ 


+ 


+ 




+ 


10A2 


+ 






+ 


+ 


+ 




+ 


3E8 


+ 






+ 




+ 






10D7 










; 


+ 




+ 


2H7 


+ 
















6H7 


+ 








+ 








3A4 








- 


+ 


+ 


+ 




14H6 










+ 


+ 


+ 




7G2 










+ 


+ 




+ 


13H10 










+ 






+ 


7E8 










+ 








7H6 

a lMal 


"IC lictosl in 






- 


+ 









10 



15 



The results obtained from the immunoreactivity studies of 
the Mabs (Table 4 and Table 5) are in agreement with the 
protein sequences derived from the respective gene 
sequences. Indeed the Mabs cross-reactive with BVH-3 and 
BVH-11 molecules recognized BVH-3C protein corresponding to 
the conserved region, and BVH-11 and BVH-11-2 specific Mabs 
were reactive with epitopes located on variable parts of 
these molecules. BVH-3 and BVH-11, and BVH-11 and BVH-11-2 
can be distinguished by their reactivity with Mabs. 



EXAMPLE 10 
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This example illustrates the simultaneous expression of 
mmzl and fflffl-ii gene products by pnevmnoniae. 

A standard Western blot technique was used to investigate 
whether BVH^3 and BVH-11 genes were expressed in S^ 
pneumoniae . pneumoniae strain SP64 and SP63 were grown 

overnight at 37°C in 5% CO, on chocolate agar plates, 
bacteria were suspended in PBS and heat-killed at 56°C for 
20 min. For the preparation of antigens, suspensions of ^ 
pneumoniae were treated with sample buffer containing SDS 
and 2-mercaptoethanol for 5 min at 100°C. Pneumococcal 
protein antigens were resolved by SDS-PAGE electrophoresis 
according to the method of Laemmli [Nature, 227, pp. 680- 
685 (1970)]. After SDS-PAGE, the proteins were transferred 
electrophoretically from the gel to nitrocellulose paper by 
the method of Towbin [Proc. Natl. Acad. Sci. USA, 76, pp 
4350-4354 (1979)] and probed with mouse antiserum or 
monoclonal antibodies. The detection of antigens reactive 
with the antibodies was performed by indirect enzyme- 
immunoassay using conjugated-anti -mouse immunoglobulins and 
a colour substrate, when antiserum raised to recombinant 
BVH-3 was tested against fi, pneumonia S P64 antigens, two 
reactive bands having apparent molecular masses of 127 kDa 
and 99 kDa were detected. Bands having the same apparent 
molecular masses were also detected when Mabs H3-1-F9, H3- 
1-D4, H3-1-H12, H11-1-E7, H11-1-H10 and Hll-1 . 1-G11 were 
used individually as immunological probes. in contrast, 
Mabs specific for the BVH-3 molecule detected the 127 kDa 
band only and Mabs specific for BVH-11 detected the 99 kDa 
band only thus confirming the identity of the 127 and 99 
kDa bands as BVH-3 and BVH-11, respectively. These studies 
provide evidence that BVH-3 and BVH-11 proteins are 
simultaneously present on S^. pneumonia Moreover, the 
results are consistent with our previous observations that 
BVH-3 and BVH-11 possess epitopes that are common to both 
proteins and epitopes that are exclusive to either protein 
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15 



In S,. pneumoniae SP64, mature BVH-3, BVH-11 and BVH-11-2 
are proteins of 1019, 821 and 819 amino acids with 
predicted molecular mass of 112.5 kDa, 92.4 kDa, and 91.7 
kDa, respectively. Although there is a discrepancy between 
the molecular mass predicted from the sequence and the 
molecular mass calculated on SDS-PAGE, BVH-3 can be 
distinguished from BVH-11 by its higher molecular mass. 
Moreover, BVH-3 molecules from S,. pneumoniae strain SP63 
have an apparent molecular mass of 112 kDa in SDS-PAGE 
compared to 127 kDa for BVH-3 of SP64 strain. This data is 
consistent with the deletion of a stretch of 177 amino acid 
residues in BVH-3 of ^ pneumoniae strain SP63 . 



EXAMPLE 11 



20 



30 



This example describes the protection conferred in 
experimental infection of mice vaccinated with recombinant 
BVH-3 or BVH-11 gene products. 



Groups of 7 or 8 female BALB/c mice (Charles River) were 
immunized subcutaneous ly three times at three-week 
intervals with either affinity purified thioredoxin- 
25 His.Tag-BVH-3M fusion protein, affinity purified 

thioredoxin-His.Tag-BVH-HM fusion protein or, as control, 
with QuilA adjuvant alone in PBS. Twelve to 14 days 
following the third immunization, the mice were challenged 
intravenously with pneumoniae WU2 strain or intranasally 
with P4241 strain. Samples of the S, pneumonia challenge 
inoculum were plated on chocolate agar plates to determine 
the CFU and to verify the challenge dose. The challenge 
dose was approximately 10 s CFU. Deaths were recorded for a 
period of 14 days and on day 14 post-challenge, the 
surviving mice were sacrificed and blood samples tested for 
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the presence of S^. pneumoniae organisms. The survival data 
are shown in Tables 6 and 7 . 



Table 6. Protection mediated by recombinant BVH-3M and 
BVH-11M proteins in experimental infection with virulent 
pneumoniae WU2 



Experiment 


Immunogen 


Alive : 


dead* 


Median days alive 


1 


BVH-3M 


8 


: 0 




>14 




none 


0 : 


8 




1 


2 


BVH-11M 


8 : 


0 




>14 




none 


0 : 


8 




1 



14 post-challenge, 



the number of mice dead on day 



Table 7. Protection mediated by recombinant BVH-3M and 
BVH-11M proteins in experimental pneumonia with virulent 
pneumoniae P4241 



Experiment 


Immunogen 


Alive : 


dead 4 


Median day alive 


1 


BVH-3M 


6 


: 1 




>14 




none 


1 


7 




4.5 


2 


BVH-3M 


8 : 


0 




>14 




BVH-11M 


8 : 


0 




>14 




none 


0 : 


8 




4 



14 post-challenge. 



the number of mice dead on day 



All mice immunized with recombinant BVH-3M or BVH-11M 
protein survived to infection with WU2 while none of the 
control mice given adjuvant alone survived. All except one 
mice immunized with recombinant BVH-3M or BVH-11M protein 
survived to infection with P4241 while only one control 
mice given adjuvant alone survived. All hemocultures from 
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surviving mice were negative at day 14 post-challenge. 
These results clearly indicate that both, BVH-3M and BVH- 
11M, elicit protective anti -pneumococcal immune responses 
in mice. The fact that these proteins are highly conserved 
among JL. pneumoniae isolates emphasize the potential of 
BVH-3 and BVH-11 as universal vaccine candidates. Indeed, 
the BVH-3 and BVH-11 proteins from serogroup 6 S. 
pneumoniae strain SP64 elicited protection against 
pneumococcal infections with strains of different capsular 
serotypes . 

Ideally, a vaccine that could protect against pneumococcal 
disease, could protect against meningitis, otitis media, 
bacteremia and pneumonia. BVH-3 and BVH-11 were protective 
against lethal systemic- and pneumonia-infection models 
thus suggesting that, in humans, BVH-3- and BVHll-protein- 
based vaccines could reduce the incidence of a wide 
spectrum of disease caused by virtually all S^. pneumoniae 
independently of the capsular serotype. 

Data from Tables 6 and 7 clearly demonstrate that BVH-3 and 
BVH-11 were, both, protection-eliciting molecules of £L_ 
pneumoniae . It was not known, however, whether protection 
can be mediated by specific sequences that were not shared 
on BVH-3 and BVH-11 molecules. Groups of female BALB/c 
mice (Charles River) were immunized subcutaneous ly three 
times at three-week intervals with either affinity purified 
thioredoxin-His»Tag- BVH-3 AD, -BVH-3B or -BVH-3C fusion 
protein in presence of 15 ng of QuilA adjuvant (Cedarlane 
Laboratories Ltd, Hornby, Canada) . Control mice were 
immunized with QuilA adjuvant alone in PBS or affinity 
purified thioredoxin-His»Tag or thioredoxin-His^Tag- fusion 
protein (His-Thio) in presence of QuilA. 

To determine the protective ability of a set of truncated 
proteins, termed NEW4, NEW5, NEW6, NEW7, NEW8, NEW9, NEW10, 



59 



WO 00/39299 

PCT/CA99/01218 

NEW11, NEW14 and BVH-11B, groups of female BALB/c mice 
(Charles River) were immunized subcutaneous ly two times at 
three-week intervals with 25 /xg of either affinity purified 
His«Tag-fusion protein in presence of 15 ng of QuilA 
adjuvant. Ten to 14 days following the last immunization, 
the mice were challenged with virulent pneumoniae Our 
results indicate that, BVH-3B, a truncated BVH-3 molecule 
consisting of amino acids 512-1039, elicited protection 
against the mouse- virulent strains WU2 and P4241. 
Similarly, BVH-11B, NEW4 and NEW5 molecules, three 
truncated BVH-11 molecules consisting of amino acids 354- 
840, amino acids 286-840 and amino acids 286-713, 
respectively, elicited protection against experiment 
intravenous challenge with WU2 and intranasal challenge 
with P4241. Moreover, vaccination with NEW10 and NEW14, 
consisting of amino acids 272-838 and amino acids 227-699 
from BVH-11-2 molecule also resulted in protection against 
death with the pneumococcal strains. These results 
indicate that the region comprising 428 amino acids 
extending from amino acids 286-713 and amino acids 272-699 
on pneumoniae SP64 BVH-11 and BVH-11-2 protein 
sequences, respectively, contains protective epitopes. 
This region is highly conserved with a global 91% identity 
and 94% homology among thirteen BVH-11 protein sequences. 



60 



WO00/39299 



PCT/CA99/01218 



Table 8. Evaluation of protection elicited by vaccination 
of mice with BVH-3 and BVH-11 gene products 







Challenge with WU2 


Challenge with P4241 


Experiment 


Immunogen 


Alive : dead 8 


Median day 


1 Alive : dead 


Median day 


- b .. 






alive 




alive 


1 


None 


0 : 8 


1.5 


1 ; 7 


4.5 




NEW4 


8 : 0 


>14 


8 : 0 


>14 




NEW5 


8 : 0 


>14 


8 : 0 


>14 




NEW7 


0 : 8 


2 


0 : 8 


5 




BVH-11M 


8 : 0 


>14 


8 : 0 


>14 


2" 


None 


0 : 8 


1 


0-8 


A 




NEW5 


8 : 0 


>14 


8 : 0 


>14 




NEW8 


0 : 8 


1.5 


0 : 8 






NEW9 


3 : 5 


3.5 


2 : 6 


7 




BVH-11M 


8 : 0 


>14 


8 : 0 


>14 


3" 


None 


0 : 8 


1 


0 • ft 


A 




NEW6 


0 : 8 


1 


4 • 4 






NEW10 


8 : 0 


>14 


8:0 


>14 




NEW11 


0 : 8 


1.5 


1 : 7 






BVH-11M 


8 : 0 


>14 


8 : 0 


>14 


4° 


None 


0 : 8 


2 


0 : 8 


4 




BVH-11B 


7 : 1 


>14 


8 : 0 


>14 




NEW14 


8 : 0 


>14 


8 : 0 


>14 


5 


His-Thio 


0 : 8 


2 








BVH-3 AD 


1 : 7 


2.5 








BVH-3B 


5 : 3 


>14 






6 


His-Thio 


0 : 8 


1 








BVH-3C 


0 : 8 









he number of mice dead on day 
14 post-challenge. 

b The WU2 challenge dose was 10 s CFU. 

c Mice living longer than 14 days were assigned a survival 
time of 14 days for the determination of median values. 
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EXAMPLE 12 
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This example described the cloning and expression of a 
chimeric gene encoding for a chimeric polypeptide 
corresponding to the carboxy- terminal region of BVH-3 in 
fusion at the C end to the carboxy -terminal region of BVH- 
11 and the additive protection observed after vaccination 
with a chimeric polypeptide. 

It is clear from the studies described above that BVH-3 and 
BVH-11 are serologically distinct molecules simultaneously 
present on pneumoniae. The results of immunological 
studies of mice indicate that both proteins are good 
vaccine candidates. These proteins have the potential to 
provide protection against all pneumococci, regardless of 
serotype. Even though the two proteins share epitopes and 
sequences, they have different characteristics and may 
serve different biological functions. Thus, immunization 
against the two proteins may provide a higher level of 
protection than that imparted by each individually. To 
examine this, several avenues where full-length or 
truncated BVH-3 and BVH-11 are administered in combination 
or in conjugation can be explored. Here we describe the 
genetic engineering of a BVH-3-BVH-11 fusion gene and 
protein, termed NEW12 (SEQ ID NO: 76 and SEQ ID NOt58, 
respectively), and the potential use of NEW12 protein as a 
vaccine. 

BVH " 3 311(1 SVH^ll gene fragments corresponding to the 3 'end 
of the genes were amplified by PCR using pairs of 
oligonucleotides engineered to amplify fragments spanning 
nucleotides 1414 to 3117 (SEQ ID NOi 1) and nucleotides 1060 
to 2520 (SEQ ID NO* 3) from ^ pneumoniae strain SP64 BVH-3 
and BVH-11 genes, respectively. The primers used, HAMJ278 
and HAMJ279; HAMJ282 and HAMJ283 had a restriction 
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endonuclease site at the 5' end, thereby allowing 
directional in-frame cloning of the amplified product into 
the digested pET21b(+) plasmid vector (Table 2). PCR- 
amplified products were digested with restriction 
endonucleases and ligated to linearized plasmid pET21b{ + ) 
vector digested likewise. The resultant plasmid constructs 
were confirmed by nucleotide sequence analysis. The 
recombinant pET21b( + ) plasmid containing the Ndel-Hindlll 
BVH-3 PCR product was linearized by digestion with the 
restriction enzymes Hindlll and NotI for the in- frame 
cloning of the Hindlll-NotI DNA fragment obtained from the 
recombinant pET21(+) vector containing the BVH-11 gene 
fragment. Clones were first stabilized in E. coli DH5ot 
before introduction into L coli BL21(XDE3) for expression 
of a chimeric pneumococcal protein molecule. The 
recombinant chimeric polypeptide, termed NEW 12, was 
expressed as C-terminal fusion with an His-tag. The 
expressed recombinant NEW 12 protein was purified from 
supernatant fractions obtained from centrifugation of 
sonicated IPTG-induced E, coli cultures using a His-Bind 
metal chelation resin (QlAgen, Chatsworth, CA) . 

According to the same procedure described above, it is 
possible to construct other chimeric polypeptides, as a 
result of a simultaneous expression of New 1 and New 4, New 
1 and New 5, New 1 and New 10, or New 1 and New 14. The 
construction can be with New 1 upstream or downstream of 
New 4, New 5, New 10, BVH-llB or New 14. It is also 
possible to construct other chimeric polypeptides as a 
result of a simultaneous expression of more than two 
fragments of either genes of BVH-3, BVH-11 or BVH-11-2. 

Groups of 8 female BALB/c mice (Charles River) were 
immunized subcutaneous ly two times at three-week intervals 
with 25 of either affinity purified His«Tag- fusion NEW1, 
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BVH-11B or NEW12 protein in presence of 15 ng of QuilA 
adjuvant. Ten to 14 days following the last immunization, 
the mice were challenged with virulent pneumoniae . As 
demonstrated before, NEW1 and BVH-11B molecules comprising 
5 amino acids 472 to 1039 from BVH-3 protein and amino acids 
354-840 from BVH-11 protein, respectively, correspond to 
portions of the proteins capable of eliciting a protective 
immune response. To determine if a chimeric polypeptide 
would significantly improve the protection compared with 

10 those seen for the individual counterparts, the challenge 
dose was adjusted in a manner that protection was not 
expected with NEW1 and BVH-11B molecules. Interestingly, 
the chimeric NEW12 protein, elicited protection against the 
mouse-virulent strains WU2 and P4241. Seven out of 8 mice 

15 immunized with NEW12 were still alive 10 days after the 

challenge while 28 out of 32 mice immunized with NEW1, BVH- 
11B, BVH-3M or adjuvant alone were dead by five days post- 
challenge. Thus, vaccination of mice with NEW12 provided 
the highest degree of protection against WU2 challenge. 
20 These results indicate that immunization with a chimeric 
polypeptide and possibly a combination of BVH-3 and BVH-11 
gene products can provide additional protection to that 
obtained by administration of BVH-3 or BVH-11 antigens 
alone . 

25 

Table 9. Evaluation of protection elicited by vaccination 
of mice with the chimeric NEW12 molecule 





Challenge with WU2 


Challenge with P4241 


Immunogen 


Alive : dead 3 


Median day 


Alive : dead 


Median day 






alive 




alive 


None 


0:8 


1 


0 : 8 


5 


NEW1' 


2 : 6 


2 


1 : 7 


8 


BVH-11B 


1 : 7 


3.5 


8 : 0 


>14 


NEW12 


6 : 2 


>14 


7 : 1 


>14 


BVH-3M 


1 : 7 


3 


8 : 1 


>14 
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EXAMPLE 13 



5 



10 



15 



20 



25 



30 



This example illustrates the identification of additional 
BVH^l and fiHfcli related sequences in Streptococcus species 
other than £L_ pneumoniae 

It was previously shown that BVH-3, BVH-11 and BVH-11-2 are 
a family of related proteins sharing common sequences. 
Homology searches were performed with the nucleotide 
sequence from the conserved region of these genes and 
compared with GenBank and . EMBL sequences using FASTA . The 
most significant homology was observed with a 2.469-kb gene 
coding for a calculated 92-kDa protein (SEQ id NO» 81) of 
unknown function in agalactia also called group B 
streptococcus or GBS. The gene was designated BVH-71. A 
protein demonstrating 99.2% identity and 99.5% similarity 
with that of GBS was also identified in ^ P voa 6naH also 
called group A streptococcus or GAS (SEQ ID NO a 83). The 5' 
region of the BVH-71 sequences (SEQ ID NO: 80 and SEQ ID 
NO* 82), spanning nucleotides 1 to 717, demonstrated 58 and 
60% identity with the conserved regions of BVH-3 
(nucleotides 1 to 675) and BVH-11 (nucleotides 1 to 684) 
genes respectively. The first 239 amino acids of the 
translated sequences of the GBS and GAS BVH-71 open reading 
frames are 51 and 54% identical to the first 225 and 228 
amino acids of BVH-3 and BVH-11, respectively, m addition 
to structural similarities, streptococcal BVH-3, BVH-11 and 
BVH-71 proteins also share antigenic epitopes. A 97-kDa 
band was revealed on Western blots of GAS or GBS whole 
cells, using Mab Hll-l.i-Gll reactive with the BVH-3 and ■ 
BVH-11 conserved regions. Similarly, GAS and GBS 
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recombinant BVH-71 proteins were detected in Western 
immunoblot analysis. 

These results indicate that BVH-71, BVH-3 and BVH-11 
proteins might share similar functions. Our results also 
suggest that BVH-71 proteins can be used as protein vaccine 
components of anti-streptococcus. In a further embodiment 
BVH-71 proteins can be used as protein vaccine components 
of anti-GAS or anti-GBS vaccines. 
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What is claimed is, 



1. 



An isolated polynucleotide encoding a polypeptide having 
at least 70% identity to a second polypeptide having a 
sequence chosen from: SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 
55 to 75, 77 to 79, 81, 83 or fragments, analogs or .' 
derivatives thereof. 

2. A polynucleotide according to claim 1, wherein said 
polynucleotide encodes a polypeptide having at least 95% 
identity to the second polypeptide. 

3. An isolated polynucleotide encoding a polypeptide capable 
of generating antibodies having binding specificity for a 
polypeptide having a sequence chosen from: SEQ ID NOs: 2, 
4, 6, 8, 10, 14, 16, 55 to 75, 77 to 79, 81, 83 or 

fragments, analogs or derivatives thereof. 



4 



An isolated polynucleotide that is complementary to the 
polynucleotide of claim 1. 



5 . An isolated polynucleotide that is complementary to the 
polynucleotide of claim 3. 

6. The polynucleotide of claim 1, wherein said 
polynucleotide is DNA. 

7. The polynucleotide of claim 3, wherein said 
polynucleotide is DNA. 

8. The polynucleotide of claim 1, wherein said 
polynucleotide is RNA. 

9. The polynucleotide of claim 3, wherein said 
polynucleotide . is RNA. 
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10. A vector comprising the polynucleotide of claim 1 
wherein said DNA is operably linked to an expression 
control region. 

11. A vector comprising the polynucleotide of claim 3, 
wherein said DNA is operably linked to an expression 
control region. 

12. A host cell transfected with the vector of claim 10. 

13. A host cell transfected with the vector of claim 11. 

14. A process for producing a polypeptide comprising 
culturing a host cell according to claim 12 under 
conditions suitable for expression of said polypeptide. 

15. A process for producing a polypeptide comprising 
culturing a host cell according to claim 13 under 
condition suitable for expression of said polypeptide 



16, 



17 



An isolated polypeptide having at least 70% identity to a 
second polypeptide having an amino acid sequence chosen . 
from: SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 to 75, 77 
to 79, 81, 83 or fragments, analogs or derivatives 
thereof . 

An isolated polypeptide, capable of generating antibodies 
having binding specificity for a second polypeptide 
having a sequence chosen from: SEQ id NOst 2, 4, 6 8 
10, 14, 16, 55 to 75, 77 to 79, 81, 83 or fragments, ' 
analogs or derivatives thereof. 

18. An isolated polypeptide having an amino acid sequence 

chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 to 75, 
77 to 79, 81, 83 or fragments, analogs or derivatives 
thereof. 
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19. An isolated polypeptide according to claim 18, wherein the 
N- terminal Met residue is deleted. 

20. An isolated polypeptide according to claim 18, wherein the 
secretory amino acid sequence is deleted. 



21 



22 



A chimeric polypeptide comprising two or more 
polypeptides chosen from SEQ ID NOs, 2, 4, 6/ 8, 10 14 
16, 55 to 75, 77 to 79, 81, 83 or fragments, analogs or' 
derivatives thereof; provided that the polypeptides or 
fragments, analogs or derivatives thereof are linked as 
to form a chimeric polypeptide. 

A chimeric polypeptide comprising two or more polypeptides 
chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 69, 

72, 74, 77 or fragments, analogs or derivatives thereof- 
•provided that the polypeptides or fragments, analogs or' 
derivatives thereof are linked as to form a chimeric 
polypeptide. 

23. A chimeric polypeptide of formula (I) ; 

A -<B) m -(C) a -D (I) 

Wherein; 

m is 0 or 1, 
n is 0 or 1, 

A is chosen from SEQ ID NOs, 2, 4, 6, 8, 10, 14, 16, 55 
to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof; 

B is chosen from SEQ ID NOs, 2, 4, 6, 8, 10, 14, 16, 55 
to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof; 

C is chosen from SEQ id NOs, 2, 4, 6, 8, 10, 14, 16, 55 
to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof; and 

D is chosen from SEQ id NOs, 2, 4, 6, 8, 10, 14, 16, 55 
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to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof. 

A chimeric polypeptide of formula (I) : 
A-(B)„-( C ) n -D (I) 
Wherein ; 
m is 0 or 1, 
n is 0 or 1, 

A is chosen from SEQ id NOs :10, 58, 60, 62, 64, 67, 68, 

69, 72, 74, 77 or fragments, analogs or derivatives 
thereof; 

B is chosen from SEQ ID NOs tlQ, 58, 60, 62, 64, 67, 68, 

69, 72, 74, 77, or fragments, analogs or derivatives 
thereof ; 

C is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 

69, 72, 74, 77 or fragments, analogs or derivatives 
thereof; and 

D is chosen from SEQ id NOs :10, 58, 60, 62, 64, 67, 68, 

69, 72, 74, 77 or fragments, analogs or derivatives 
thereof . 

A vaccine composition comprising a polypeptide according 
to any one of claims 16 to 24 and a pharmaceutical^ 
acceptable carrier, diluent or adjuvant. 

26. A method for therapeutic or prophylactic treatment of 

meningitis, otitis media, bacteremia or pneumonia 

infection in an individual susceptible to meningitis, 

otitis media, bacteremia or pneumonia infection comprising 

administering to said individual a therapeutic or 

prophylactic amount of a composition according to claim 
25 . 



25 



27. A method for therapeutic or prophylactic treatment of 
streptococcal bacterial infection in an individual 
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susceptible to streptococcal infection comprising 
administering to said individual a therapeutic or 
prophylactic amount of a composition according to claim 



25. 



A method according to claim 26, wherein said individual is 
a mammal . 

A method according to claim 27, wherein said individual is 
a human. 

A method according to claim 22, wherein said bacterial 
infection is S. pneumoniae, group A streptococcus 
(pyogenes) , group B streptococcus (GBS or agalactiae) , 
dysgalactiae. uteris, nocardia or Staphylococcus aureus. 

A method according to claim 26, wherein said bacterial 
infection is S. pneumonia e 

Use of a vaccine composition according to claim 25 for 
the prophylactic or therapeutic treatment of 
Streptococcal infection in an animal susceptible to or 
infected with streptococcal infection comprising 
administering to said animal a prophylactic or 
therapeutic amount of the composition. 



I 
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ATGAAATTTA 
TGTGCCTATG 
TATGTGGATG 
CAGAAAGAAG 
ACGTCACACG 
AGTGAAGAAC 
GAAGTCAAGG 
GCAGCTCATG 
GTCAAAGATA 
ACGACAAATG 
TATATCGTTC 
GAATTAGCAG 
TATTCTTCAA 
CCAGCAAATA 
GCCCAACGTT 
ACACCAAATG 
CTTTCTGCTT 
GTTTCTACAA 
CCTTCTTCTT 
CCAAAAGATA 
CATTACATTC 
ACACCTTCTC 
GGATACGGAT 
CACGGAGACC 
GCGCAAAAAC 
CATGAACAGG 
GAAAAAATTG 
AAAGAAAAAA 
GAACATAAAC 
GAAGGAGTTG 
GTTAATTTGT 
AAACGCGTTT 
GTAAAATTAA 
GAAGGAGTAG 
TTTAAGTATA 
GTTCCAACCT 
GGGGATACTT 
GTCAGAGTGT 
GAAATCAAAT 
ATTCCTGTAA 
GTACCTATCT 
AGGAATAAAG 
GAGAAGGTAG 
GAAGAAGTTC 
GGGATGAAGC 
TCAGGAGAAG 
GGTGAAAATA 
ACAGAAAATA 
GAAAACTCAA 
TTAGATCCA6 
TTTACAGCTA 
GAATTAAGAT 
(SEQ ID NO i 



GTAAAAAATA 
CACTAAACCA 
GCAGCCAGTC 
GAATTCAGGC 
GTGACCACTA 
TCTTGATGAA 
GTGGTTATAT 
CTGATAATGT 
ATGAGAAGGT 
ATGGTTATGT 
CTCATGGAGG 
CAGCTAAAGC 
CAGCTAGTGA 
AATCTGAAAA 
ACAGTGAATC 
GAGTTGCGAT 
TAGAAGAAAA 
ATGCAAAACC 
TAACGACAAG 
TCGTTGAAGA 
CAAAATCAAA 
CATCTCTTCC 
TTGATGCTAA 
ACAATCATTA 
ATTTAGAGGA 
ATTATCCAGG 
CTGGCATTAT 
ATGCGATTAT 
CGGTTGGAAT 
CTAAAAAAGA 
TAAAAAATAG 
CTTTTAGTTT 
TAACACCAGA 
GGAATATTGC 
CTATCGCTTC 
CTTTAGCTTA 
ATTTAAGAGT 
TTGATGAATT 
TACCGATTCC 
CCTTCATGGC 
TGGAAAAAGA 
CACAAGAAAA 
AAAAAGAAAA 
CTACAGTGGA 
TAGAAAATGT 
TCATTAAAAA 
AACCATCTGA 
AACCAGCAGA 
CGGATAATGG 
CATTAGAGGA 
GTTACGGATT 
TGCCAAGTGG 
1) 



TATAGCAGCT GGATCAGCTG 
GCATCGTTCG CAGGAAAATA 
AAGTCAGAAA AGTGAAAACT 
TGAGCAAATT GTAATCAAAA 
TCATTACTAT AATGGGAAAG 
GGATCCAAAC TATCAACTTA 
CATCAAGGTC GATGGAAAAT 
TCGAACTAAA GATGAAATCA 
TAACTCTAAT GTTGCTGTAG 
CTTTAATCCA GCTGATATTA 
TCACTATCAC TACATTCCCA 
ACATCTGGCT GGAAAAAATA 
CAATAACACG CAATCTGTAG 
TCTCCAGAGT CTTTTGAAGG 
AGATGGCCTG GTCTTTGACC 
TCCGCATGGC GACCATTACC 
GATTGCCAGA ATGGTGCCTA 
TAATGAAGTA GTGTCTAGTC 
TAAGGAGCTC TCTTCAGCAT 
AACGGCTACA GCTTATATTG 
TCAAATTGGG CAACCGACTC 
AATCAATCCA GGAACTTCAC 
TCGTATTATC GCTGAAGATG 
TTTCTTCAAG AAGGACTTGA 
AGTTAAAACT AGTCATAATG 
TAATGCCAAA GAAATGAAAG 
GAAACAATAT GGTGTCAAAC 
TTATCCGCAT GGAGATCACC 
TGGTCATTCT CACAGTAACT 
AGGGAATAAA GTTTATACTG 
TACGTTTAAT AATCAAAACT 
TCCGCCTGAA TTGGAGAAAA 
TGGAAAAGTA TTGGAGAAAG 
AAACTTTGAA TTAGATCAAC 
AAAAGATTAT CCAGAAGTAA 
CAAAATGGCC AGTCAAACGA 
GAACCCTCAA TTTGCAGTGC 
TCATGGAAAT GCTTATTTAG 
GAAATTAAAC CAAGGAACAA 
AAATGCTTAT TTGGACAATC 
AAATCAAACT GATAAACCAA 
CTCAAAACTT GATGAAAAGG 
ACTTTCTGAA ACTGGGAATA 
TCCTGTACAA GAAAAAGTAG 
CTTGTTTAAT ATGGACGGAA 
GAATATGGCA GATTTTACAG 
AAATGGAAAA GTATCTACTG 
TTCTTTACCA GAGGCACCAA 
AATGTTGAAT CCAGAAGGGA 
AGCTCCAGCA GTAGATCCTG 
AGGCTTAGAT AGTGTTATAT 
AGAAGTGATA AAAAAGAATT 
FIGURE 1 



TTATCGTATC 
AGGACAATAA 
TGACACCAGA 
TTACAGATCA 
TTCCTTATGA 
AAGACGCTGA 
ATTATGTCTA 
ATCGTCAAAA 
CAAGGTCTCA 
TCGAAGATAC 
AAAGCGATTT 
TGCAACCGAG 
CAAAAGGATC 
AACTCTATGA 
CTGCTAAGAT 
ACTTTATTCC 
TCAGTGGAAC 
TAGGCAGTCT 
CTGATGGTTA 
TAAGACATGG 
TTCCAAACAA 
ATGAGAAACA 
AATCAGGTTT 
CAGAAGAGCA 
GATTAGATTC 
ATTTAGATAA 
GTGAAAGTAT 
ATCATGCAGA 
ATGAACTGTT 
GAGAAGAATT 
TTACTCTAGC 
AATTAGGTAT 
TATCTGGTAA 
CTTATTTACC 
GTTATGATGG 
TTTTCTATCC 
CTAAAGGAAC 
AAAATAACTA 
CCAGAACGGC 
AATCGACTTA 
GTATTCTACC 
TAGAAGAACC 
GTACTAGTAA 
CAAAATTTGC 
CAATTGAATT 
GAGAAGCACC 
GAACAGTTGA 
ACGAAAAACC 
ATGTGGGGAG 
TACAAGAAAA 
TCAATATGGA 
TATCTGATTT 



CTTGAGTCTA 
TCGTGTCTCT 
CCAGGTTAGC 
GGGCTATGTA 
TGCCCTCTTT 
TATTGTCAAT 
CCTGAAAGAT 
ACAAGAACAT 
GGGACGATAT 
GGGTAATGCT 
ATCTGCTAGT 
TCAGTTAAGC 
AACTAGCAAG 
TTCACCTAGC 
TATCAGTCGT 
TTACAGCAAG 
TGGTTCTACA 
TTCAAGCAAT 
TATTTTTAAT 
TGATCATTTC 
TAGTCTAGCA 
TGAAGAAGAT 
TGTCATGAGT 
AATTAAGGCT 
TTTGTCATCT 
AAAAATCGAA 
TGTCGTGAAT 
TCCGATTGAT 
TAAACCCGAA 
AACGAATGTT 
CAATGGTCAA 
CAATATGCTA 
AGTATTTGGA 
AGGACAAACA 
TACATTTACA 
TTTCCATGCA 
TGATGCTTTA 
TAAAGTTGGT 
CGGAAATAAA 
TATTGTGGAA 
ACAATTTAAA 
AAAGACTAGT 
TTCAACGTTA 
TGAAAGTTAT 
ATATTTACCA 
TCAAGGAAAT 
GAACCAACCA 
TGTAAAACCA 
TGACCCTATG 
ATTAGAAAAA 
TGGAACGATT 
CATAGCGTAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1060 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2260 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
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MKFSKKYIAA GSAVIVSLSL CAYALNQHRS 
SENLTPDQVS QKEGIQAEQI VIKITDQGYV 
SEELLMKDPN YQLKDADIVN EVKGGYIIKV 
DEINRQKQEH VKDNEKVNSN VAVARSQGRY 
YIVPHGGHYH YIPKSDLSAS ELAAAKAHLA 
QSVAKGSTSK PANKSENLQS LLKELYDSPS 
TPNGVAIPHG DHYHFIPYSK LSALEEKIAR 
VSSLGSLSSN PSSLTTSKEL SSASDGYIFN 
HYIPKSNQIG QPTLPNNSLA TPSPSLPINP 
AEDESGFVMS HGDHNHYFFK KDLTEEQIKA 
HEQDYPGNAK EMKDLDKKIE EKIAGIMKQY 
GDHHHADPID EHKPVGIGHS HSNYELFKPE 
VNLLKNSTFN NQNFTLANGQ KRVSFSFPPE 
LEKVSGKVFG EGVGNIANFE LDQPYLPGQT 
VPTSLAYKMA SQTIFYPFHA GDTYLRVNPQ 
AYLENNYKVG EIKLPIPKLN QGTTRTAGNK 
VPILEKENQT DKPSILPQFK RNKAQENSKL 
TGNSTSNSTL EEVPTVDPVQ EKVAKFAESY 
SGEVIKKNMA DFTGEAPQGN GENKPSENGK 
EAPNEKPVKP ENSTDNGMLN PEGNVGSDPM 
FTASYGLGLD SVIFNMDGTI ELRLPSGEVI 



QENKDNNRVS 


YVDGSQSSQK 


50 


TSHGDHYHYY 


NGKVPYDALF 


100 


DGKYYVYLKD 


AAHADNVRTK 


150 


TTNDGYVFNP 


ADIIEDTGNA 


200 


GKNMQPSQLS 


YSSTASDNNT 


250 


AQRYSESDGL 


VFDPAKIISR 


300 


MVPISGTGST 


VSTNAKPNEV 


350 


PKDIVEETAT 


AYIVRHGDHF 


400 

^* V w 


GTSHEKHEED 


GYGFDANRII 


t 3U 


AQKHLEEVKT 


SHNGLDSLSS 


cnn 


GVKRESIWN 


KEKNAIIYPH 


•J 3 \J 


EGVAKKEGNK 


VYTGEELTNV 


600 


LEKKLGINML 


VKLITPDGKV 


650 


FKYTIASKDY 


PEVSYDGTFT 


700 


FAVPKGTDAL 


VRVFDEFHGN 


750 


IPVTFMANAY 


LDNQSTYIVE 


800 


DEKVEEPKTS 


EKVEKEKLSE 


850 


GMKLENVLFN 


MDGTIELYLP 


900 


VSTGTVENQP 


TENKPADSLP 


950 


LDPALEEAPA 


VDPVQEKLEK 


1000 


KKNLSDFIA ( 


SEQ ID NO: 2) 


1039 



FIGURE 2 
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ATGAAAATCA 
GCTTATQAAC 
ATAGATGGAA 
CGTGAAGGAA 
TCTCATGGAG 
GAAGAGCTCC 
ATCAAGGGTG 
GCTCATGCGG 
CAGCATCGTG 
GGACGCTACA 
GGCGATGCCT 
TCAGCTAGCG 
TTAAGAACCT 
GTAAGCAATC 
GCAAGTCAAA 
CAACGCCATG 
GCCAGAGGTG 
TCTGAATTGG 
GTACCAGATT 
CCGCAACCTG 
AAAGAAGCTG 
TATATCCCAG 
AAGCAGGAAA 
CGAGAATTTT 
AATAAAGGTC 
GTCTCAAGTG 
CATCCAGAAC 
GTAGCCAAGT 
ATAACCAGTG 
AAAAAAGATA 
GGTTTGACCC 
GAAGCTATCT 
AATCTTCAAT 
TACCATAACA 
ACTCTTGAGG 
CATTCAGATA 
GCTGATACCA 
GAAGAAACCC 
GAGGAACCAG 
GTTGAAGAAA 
AAGTCCAATG 
GACAACAATA 
TAA (SEQ 



ATAAAAAATA 
TAGGTTTGCA 
AACAAGCGAC 
TCAACGCCGA 
ACCATTATCA 
TCATGAAAGA 
GTTATGTCAT 
ATAATGTCCG 
AAGGAGGGAC 
CCACAGATGA 
ATATCGTTCC 
AGTTGGCTGC 
ATCGCCGACA 
CAGGAACTAC 
GTAATGACAT 
TAGAATCTGA 
TAGCTGTCCC 
AAAAACGAAT 
CAAGACCAGA 
GACCAAATCC 
TTCGAAAAGT 
CCAAGAATCT 
GTTTATCTCA 
ACAATAAGGC 
GACAAGTTGA 
ATAAAGTCAA 
GTTTAGGAAA 
TGGCAGGCAA 
ATGAGGGGGA 
GTTTGTCTGA 
CTCCTTCGAC 
ACAACCGCGT 
ATACTGTAGA 
TCAAATTTGA 
ATCTTTTGGC 
ATGGTTTTGG 
ATCAAACGGA 
CTCGAGAAGA 
AAGAAGAATC 
AACTGAGAGA 
CCAAAGAGAC 
CTATTATGGC 
ID NO: 3) 



TCTAGCTGGG 
TCAAGCTCAA 
GCAAAAAACG 
ACAAATCGTC 
TTACTATAAT 
TCCGAATTAT 
TAAGGTAAAC 
TACAAAAGAA 
TTCAGCAAAC 
TGGTTATATC 
TCATGGAGAT 
TGCAGAAGCC 
AAATAGCGAT 
AAATACTAAC 
TGATAGTCTC 
TGGCCTTATT 
TCATGGTAAC 
TGCTCGTATT 
AGAACCAAGT 
TCAACCAGCT 
AGGCGATGGT 
TTCAGCAGAA 
TAAGCTAGGA 
TTATGACTTA 
TTTTGAGGCT 
GTTAGTGGAT 
ACCAAATGCG 
GTACACAACA 
TGCCTATGTA 
AGCTGAGAGA 
AGACCATCAG 
GAAAGCAGCT 
AGTCAAAAAC 
GTGGTTTGAC 
GACTGTCAAG 
TAACGCTAGC 
AAAACCAAGC 
GAAACCACAA 
ACCAGAGGAA 
GGCTGAAGAT 
TCTCACAGGA 
AGAAGCTGAA 



TCAGTAGCTA 
ACTGTAAAAG 
GAGAATTTGA 
ATCAAGATTA 
GGCAAGGTCC 
CAGTTGAAGG 
GGTAAATACT 
GAAATCAATC 
GATGGTGCGG 
TTCAATGCAT 
CATTACCATT 
TTCCTATCTG 
AACACTCCAA 
ACAAGCAACA 
TTGAAACAGC 
TTCGACCCAG 
CATTACCACT 
ATTCCCCTTC 
CCACAACCGA 
CCAAGCAATC 
TATGTCTTTG 
ACAGCAGCAG 
GCTAAGAAAA 
CTAGCAAGAA 
TTGGATAACC 
GATATTCTTG 
CAAATTACCT 
GAAGACGGTT 
ACTCCACATA 
GCGGCAGCCC 
GATTCAGGAA 
AAGAAGGTGC 
GGTAGTTTAA 
GAAGGCCTTT 
TACTATGTCG 
GACCATGTTC 
GAGGAGAAAC 
AGOGAGAAAC 
TCAGAAGAAC 
TTACTTGGAA 
TTAAAAAATA 
AAACTATTGG 



CACTTGTTTT 
AAAATAATCG 
CTCCTGATGA 
CGGATCAAGG 
CTTATGATGC 
ATTCAGACAT 
ATGTTTACCT 
GGCAAAAACA 
TAGCCTTTGC 
CTGATATCAT 
ACATTCCTAA 
GTCGGGAAAA 
GAACAAACTG 
ACAGCAACAC 
TCTACAAACT 
CGCAAATCAC 
TTATCCCTTA 
GTTATCGTTC 
CTCCAGAACC 
CAATTGATGA 
AGGAGAATGG 
GCATTGATAG 
CTGACCTCCC 
TTCACCAAGA 
TGTTGGAACG 
CCTTCTTAGC 
ACACTGATGA 
ATATCTTTGA 
TGACCCATAG 
AGGCTTATGC 
ATACTGAGGC 
CACTTGATCG 
TCATACCTCA 
ATGAGGCACC 
AACATCCAAA 
AAAGAAACAA 
CTCAGACAGA 
CAGAGTCTCC 
CTCAGGTCGA 
AAATCCAGGA 
ATTTACTATT 
CTTTATTAAA 



AAGTGTCTGT 
TGTTTCCTAT 
GGTTAGCAAG 
TTATGTGACC 
CATCATCAGT 
TGTCAATGAA 
TAAGGATGCA 
AGAACATAGT 
ACGTTCACAG 
CGAAGATACG 
GAATGAGTTA 
TCTGTCAAAT 
GGTACCTTCT 
TAACAGTCAA 
GCCTTTGAGT 
AAGTCGAACC 
TGAACAAATG 
AAACCATTGG 
TAGTCCAAGT 
GAAATTGGTC 
AGTTTCTCGT 
CAAACTGGCC 
ATCTAGTGAT 
TTTACTTGAT 
ACTCAAGGAT 
TCCGATTCGT 
TGAGATTCAA 
TCCTCGTGAT 
CCACTGGATT 
TAAAGAGAAA 
AAAAGGAGCA 
TATGCCTTAC 
TTATGACCAT 
TAAGGGGTAT 
CGAACGTCCG 
AAATGGTCAA 
AAAACCTGAG 
AAAACCAACA 
GACTGAAAAG 
TCCAATTATC 
TGGCACCCAG 
GGAGAGTAAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2523 
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MKINKKYLAG SVATLVLSVC AYELGLHQAQ 
ENLTPDEVSK REQINAEQIV IKITDQGYVT 
EELLMKDPNY OLKDSDIVNE IKGGYVIKVN 
EINRQKQEHS QHREGGTSAN DGAVAFARSQ 
GDAYIVPHGD HYHYIPKNEL SASELAAAEA 
NTPRTNWVPS VSNPGTTNTN TSNNSNTNSQ 
QRHVESDGLI FDPAQITSRT ARGVAVPHGN 
IPLRYRSNHW VPDSRPEEPS PQPTPEPSPS 
KEAVRKVGDG YVFEENGVSR YIPAKNLSAE 
AKKTDLPSSD REFYNKAYDL LARIHQDLLD 
VSSDKVKLVD DILAFLAPIR HPERLGKPNA 
EDGYIFDPRD ITSDEGDAYV TPHMTHSHWI 
GLTPPSTDHQ DSGNTEAKGA EAIYNRVKAA 
GSLIIPHYDH YHNIKFEWFD EGLYEAPKGY 
HSDNGFGNAS DHVQRNKNGQ ADTNQTEKPS 
SEKPESPKPT EEPEEESPEE SEEPQVETEK 
KSNAKETLTG LKNNLLFGTQ DNNTIMAEAE 



TVKENNRVSY IDGKQATQKT 50 

SHGDHYHYYN GKVPYDAIIS 100 

GKYYVYLKDA AHADNVRTKE 150 

GRYTTDDGYI FNASDIIEDT 200 

FLSGRENLSN LRTYRRQNSD 250 

ASQSNDIDSL LKQLYKLPLS 300 

HYHFIPYEQM SELEKRIARI 350 

PQPAPNPQPA PSNPIDEKLV 4 00 

TAAGIDSKLA KQESLSHKLG 450 

NKGRQVDFEA LDNLLERLKD 500 

QITYTDDEIQ VAKLAGKYTT 550 

KKDSLSEAER AAAQAYAKEK 600 

KKVPLDRMPY NLQYTVEVKN 650 

TLEDLLATVK YYVEHPNERP 700 

EEKPQTEKPE EETPREEKPQ 750 

VEEKLREAED LLGKIQDPII 800 
KLLALLKESK (SEQ ID NO: 4) 840 

FIGURE 4 



ATGGAGAATA TAGACATGTT TAAATCAAAT CATGAGCGAA GAATGCGTTA TTCCATTCGT 
AAATTTAGTG TAGGAGTAGC TAGCGTAGCT GTTGCCAGTC TTTTTATGGG AAGTGTTGTA 
CATGCGACAG AGAAAGAGGG AAGTACCCAA GCAGCCACTT CTTTTAATAG GGGAAATGGA 
AGTCAGGCAG AACAACGTGG AGAACTCGAT TTAGAACGAG ATAAGGCAAT GAAAGCGGTC 
AGTGAATATG TAGGAAAAAT GGTGAGAGAT GCCTATGTAA AATCAGATAG AAAACGACAT 
AAAAATACTG TAGCTCTAGT TAACCAGTTG GGAAACATTA AGAACAGGTA TTTGAATGAA 
ATAGTTCATT CAACCTCAAA AAGCCAACTA CAGGAACTGA TGATGAAGAG TCAATCAGAA 
GTAGATGAAG CTGTGTCTAA ATTTGAAAAG GACTCATTTT CTTCGTCAAG TTCAGGATCC 
TCCACTAAAC CAGAAACTCC GCAGCCGGAA AATCCAGAGC ATCAAAAACC AACAACTCCA 
TCTCCGGATA CCAAACCAAG CCCTCAACCA GAAGGCAAGA AACCAAGCGT ACCAGACATT 
AATCAGGAAA AAGAAAAAGC TAAGCTTGCT GTAGTAACCT ACATGAGCAA GATTTTAGAT 
GATATACAAA AACATCATCT GCAGAAAGAA AAACATCGTC AGATTGTTGC TCTTATTAAG 
GAGCTTGATG AGCTTAAAAA GCAAGCTCTT TCTGAAATTG ATAATGTAAA TACCAAAGTA 
GAAATTGAAA ATACAGTCCA CAAGATATTT GCAGACATGG ATGCAGTTGT GACTAAATTC 
AAAAAAGGCT TAACTCAGGA CACACCAAAA GAACCAGGTA ACAAAAAACC ATCTGCTCCA 
AAACCAGGTA TGCAACCAAG TCCTCAACCA GAGGTTAAAC CGCAGCTGGA AAAACCAAAA 
CCAGAGGTTA AACCGCAACC AGAAAAACCA AAACCAGAGG TTAAACCGCA GCCGGAAAAA 
CCAAAACCAG AGGTTAAACC GCAGCCGGAA AAACCAAAAC CAGAGGTTAA ACCGCAGCCG 
GAAAAACCAA AACCAGAGGT TAAACCGCAG CCGGAAAAAC CAAAACCAGA GGTTAAACCG 
CAGCCGGAAA AACCAAAACC AGAGGTTAAA CCGCAGCCGG AAAAACCAAA ACCAGAGGTT 
AAACCGCAGC CGGAAAAACC AAAACCAGAG GTTAAACCGC AGCCGGAAAA ACCAAAACCA 
GAGGTTAAAC CGCAGCCGGA AAAACCAAAA CCAGAGGTTA AACCGCAACC AGAAAAACCA 
AAACCAGAGG TTAAACCGCA ACCAGAAAAA CCAAAACCAG ATAATAGCAA GCCACAAGCA 
GATGATAAGA AGCCATCAAC TACAAATAAT TTAAGCAAGG ACAAGCAACC TTCTAACCAA 
GCTTCAACAA ACGAAAAAGC AACAAATAAA CCGAAGAAGT CATTGCCATC AACTGGATCT 
ATTTCAAATC TAGCACTTGA AATTGCAGGT CTTCTTACCT TGGCGGGGGC AACCATTCTT 
GCTAAGAA^A GAATGAAATA G (SEQ ID NOt 5) 

FIGURE 5 



60 
120 
180 
240 
^300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
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1581 
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MENIDMFKSN HERRMRYSIR KFSVGVASVA VASLFMGSW HATEKEGSTO 
AATSFNRGNG SQAEQRGELD LERDKAMKAV SEYVGKMVRD £^r£2 
KNTVALVNQL GNIKNRYLNE IVHSTSKSQL QELMmSqSE SS 
DSFSSSSSGS STKPETPQPE NPEHQKPTTP SPDTKPSPQP 
NQEKEKAKLA WTYMSKILD DIQKHHLQKE D« 

kpSS^ eientvhkif admdawtkf kkg£^pk 5£S£ 

ISS^ 0 * EVKP 0 LEKPK PEVKPQPEKP KPEVKPQPEK PKPEVKPQPE 
KPKPEVKPQP EKPKPEVKPQ PEKPKPEVKP QPEKPKPEVK POPEkSSev 

2^25? EVKPQp ^ K phvkpopSp 

J2???SSS P? 1 ^ 5 ™ LSKDKQPSNQ ASTNEKATNK PKKSLPSTGS 500 
ISNLALEIAG LLTLAGATIL AKKRMK (SEQ ID NO: 6) 

FIGURE 6 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
526 



ACGTCACACG GTGACCACTA TCMdSS aSSS^ TTACAGATCA GGGCTATGTA 240 

AGTGAAGAAC TCTTGAtSJ SS ™™™ TTCCTTATGA TGCCCTCTTT 300 

GAAGTCAAGG GTGGTTATAT XSg^ SJSJSi ^ ACGCTGA TATTGTCAAT 360 

GCAGCTCATG CTGATAATGT SSSJSSf n^^ T ATTATGTCTA CCTQAAAGAT 420 

GTCAAAGATA SXSSSJ JIS"*** ^CAAGAACAT 480 

ACGACAAATG ATGGTTATGT CTTTaSJS SSSSSf CAAGGTCTCA GGGACGATAT 540 

TATATCGTTC CTCATGGAGG iSSJSSS * TCGAAGATAC GGGTAATGCT 600 

GAATTAGCAG SSSKES aSSgS£ ATCTGCTAGT 660 

ACACCAAATG GAGTTGCGAT JS^SSS CTGCTAA6AT TATCAGTCGT 900 

CTTTCTGCTT TAGAAGAAaI ACTTTATTCC TTACAGCAAG 960 

CATTACATTC CAAAATCAAA icISSSS TAAGACATGG TGATCATTTC 1200 

ACACCTTCTC CATCTCTTcS ISJJI^ CAACCGACTC TTCCAAACAA TAGTCTAGCA 1260 

GGATACGGAT SSSSS iSSSS ££25? EE"™ 1 "20 

CACGGAGACC ACAATCATTA tSSS A^J^* AATCAGGTTT TGTCATGAGT 1380 

CGCAAAAACA TTTAG AAGQACTTGA CAGAAGAGCA AATTAAGGTG 1440 

1455 



FIGURE 7 
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n^ SLSL CAYALNQHRS OENKDNNRVS YVDGSQSSQK 
SSIS Q^GIQAEQI VIKITDQGYV TSHGDHYHYY NGKVPYDALF 

vqlkdadivn evkggyiikv dgkyyvylkd aahadnvrtk 

?sss ssss r^ 500 ^ 

252S SSSiSf ELAAAKAHLA GKNMQPSQLS YSSTASDNNT 
PANKSENLQS LLKELYDSPS AQRYSESDGL VFDPAKIISP 
TPNGVAIPHG DHYHFIPYSK LSALEEKIAR M^PISGTCST VStoSJX 
VSSLGSLSSN PSSLTTSKEL SSASDGYIFN PKDIVEETAT AY^S^F 
2E55E ° PTLPNNSLA "SPSLPINP GTSHEKhSS ^£55 
AEDESGFVMS HGDHNHYFFK KDLTEEQIKV RKNI (SEQ S nSTJ, 



50 
100 
150 
200 
250 
300 
350 
400 
450 
484 
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(SEQ ID HO i 9) 
FIGURE 9 



60 
120 
180 
240 
300 



TATACTGGAG AAOAATTAAC GAATGTTGTT AATTTOTra n . «"«iaaaqtt 
CAAAACTTTA CTCTAGCCAA TOQTnaAall AAAATAGTAC GTTTAATAAT 

GAGAAAAAAT TAGGTmSJ SSSJ^ SSSISP T™ 3 ™* GCCTGAATTC 360 

GAGAAAGTAT CTGGTAAAGT 2£SE£J iiSJ^ CACCA<3ATGa AAAAGTATTG 420 

GATCAACCTT ATTTACCAGG SESSJ SSSS JcS^ 252™ 48 ° 

oiiE EE ™ ™ ;™ iiE 

GACAATCAAT CGACTTATAT TGTGGAAGTA SSSSS SJSSiJJ TGCTTATTTG 840 
AAACCAAGTA TTCTACCACA ATTTiAA^ A^n^II^ AAAAAGAAAA TCAAACTGAT 900 
GAAAAGGTAG AAgScSS Sa^SS AAGAAAACTC AAAAC1TGAT 960 

GGGAATAGTA CTAGTAATTC SSSSa iiS2E5£ AAGAAAAACT TTCTGAAACT 
AAAGTAGCAA AATTTGcS A^G^SS ATPlirrTBr- CAGTGGATCC TGTACAAGAA 
GACGGAACAA TTGAATTATA iiSSSS? AG AAAATGTCTT GTTTAATATG 

tttacaggag 22ES£ SJ^lt GAT 

TCTACTGGAA CAGTTGAGAA CCaACCAApa CATCTGAAAA TGGAAAAGTA 

GCACCAAACG AAAAACCTGT SSISIJS t^™* 0 CMCAGMTC TTTACCAGAG 
GAAGGGAATG TGGGGAGTGA £££££££ ^SS^f ATAATGGAAT GTTGAATCCA 
GATCCTGTAC aSaAaSS TAGAGGAAGC TCCAGCAGTA 



540 
600 
660 
720 
780 



1020 
1080 
1140 
1200 
1260 
1320 

GATCCTGTAC AAGAAAAATT mmZ ^TUCAGCAT TAGAGGAAGC TCCAGCAGTA UTo 
GTTATATTCA I£££S£ SJJS ACGGATTAGG CTTAGATAGT JSo 

AAGAATTTAT CTGATTTCAT AGTGATAAAA 1560 
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MKDliDKKIEE KIAGIMKQYG 
HKPVGIGHSH SNYELFKPEE 
QNFTLANGQK RVSFSFPPEL 
GVGNIANFEL DQPYLPGQTF 
QTIFYPFHAG DTYLRVNPQF 
IKI.PIPKLNQ GTTRTAGNKI 
KPSILPQFKR NKAQENSKLD 
EVPTVDPVQE KVAKFAESYG 
FTGEAPQGNG ENKPSENGKV 
NSTDNGMLNP EGNVGSDPML 
VIFNMDGTIE LRLPSGEVIK 



VKRESIWNK EKNAIIYPHG 
GVAKKEGNKV YTGEELTNW 
EKKLGINMLV KLITPDGKVL 
KYTIASKDYP EVSYDGTFTV 
AVPKGTDALV RVFDEFHGNA 
PVTFMANAYL DNQSTYIVEV 
EKVEEPKTSE KVEKEKLSET 
MKLENVLFNM DGTIELYLPS 
STGTVENQPT ENKPADSLPE 
DPALEEAPAV DPVQEKLEKF 
KNLSDFIA (SEQ ID NOt 

FIGURE 10 



DHHHADPIDE 50 

NLLKNSTFNN 100 

EKVSGKVFGE 150 

PTSLAYKMAS 200 

YLENNYKVGE 250 

PILEKENQTD 300 

GNSTSNSTLE 350 

GEVIKKNMAD 400 

APNEKPVKPE 450 

TASYGLGLDS 500 

"> 528 
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BVH3 HU2 


i 


BVH3 RX1 


T 
X 


BVH3 JNR7/87 


1 


BVH3 SP64 


1 


BVH3 P4241 


1 


BVH3 A66 


1 


BVH3 WU2 


61 


BVH3 RX1 


61 


BVH3 JNR7/A7 


61 


BVH3 SP64 


61 


BVH3 P4241 


61 


BVH3 AC£ 

OVflJ ADD 


61 


BVH1 WTT9 


121 


BVH1 PV1 


121 


BVH3 JNR7/87 


121 


BVH3 SP64 


121 


BVH3 P4241 


121 


BVH3 A66 


121 


BVH3 WU2 


181 


BVH3 RX1 


181 


BVH3 JNR7/87 


181 


BVH3 SP64 


181 1 


BVH3 P4241 


181 : 


BVH3 A66 


181 : 



- - — - ♦ ~wwvwm^*wcmiua rwuvouiSJlViXUAEOIVTKTTTVYT 



YV «o 



If"™!£!!X3? GCT ^^ 120 



>KD 120 

TSHGDMYYNGKVPYDALFSEELLMKD^^ 120 



If"°!?!!)2™ 0WPTO ^ 120 

120 



aaw^nxnro 1 NRQKQEHVKDNE KVN SNVA V ARS QG R YTTNDG YVFN PAD 1 1 EDT^NA 
AAHADNVRTKD E INRQKQ EHVKDNE KVN S NVAVARS QGR YTTNDG YVFN PAD I 



180 
180 
180 
180 
180 

IEDTGNA 180 



:gstsk 



YIVPTOGI^PKSDLSASBUUUVKA^ 



240 
240 
240 
240 
240 
240 



BVH3 WU2 


241 


BVH3 RX1 


241 


BVH3 JNR7/87 


241 


BVH3 SP64 


241 


BVH3 P4241 


241 


BVH3 A66 


241 


BVH3 KU2 


301 


BVH3 RX1 


301 


BVH3 JNR7/87 


301 


BVH3 SP64 


301 


BVH3 P4241 


301 


BVH3 A66 


301 


BVH3 WU2 


361 


BVH3 RX1 


361 


BVH3 JNR7/87 


361 


BVH3 SP64 


361 


BVH3 P4241 


361 


BVH3 A66 


361 


BVH3 WU2 


421 i 


BVH3 RX1 


421 < 


BVH3 JNR7/87 


421 4 


BVH3 SP64 


421 4 


BVH3 P4241 


421 ( 


BVH3 A66 1 


421 < 



SESKSSESS^^ III 

S^f^ rospsA « RysEs ^vFDPAKii^ III 

PAmCSEMI^lXKKLVDSPSAQRYSESDGLVFDPAKIISRTPNOVAIPMGD^IPMK III 



VSZllS^^ 420 

SHEKHEED 420 

SHEKHEED 420 

SHEKHEED 420 

P^iyE^ATAYIVRHGDHFHYXPKSNQI^PrrP^^;^^;^^^ 420 



ProiVBBTATAYIVRHGDHPHVIPKSHQIGQPTLPNNSIATPSPSLPllIPOISHBKHEBD 4 5« 

PTOIVBETATAYIVlfflGOHPHYIPKSHQIGQPTLProiSIATPSPSLPIMPGTaSraKm 111 
PKDIVBBTATAYIVRHGDHPHYIPKSNQIGQPTLPNNSIATPSPSLPIMPGT^^ 



420 



^^sssssssssssssssssss^ « 

n3°^^"™!?°!^! H ^^ FF ^ LTEE 0 I ^Q^EVKTSHNaLDSLSS 48 0 



LDSLSS 480 

"^^^^"^PKHJLTEEQIKAAQKHLBBVKTSHHGLDSLSS 480 
GYGroANRriAEDESGFVMSHGDramYPFKKDLTEEQIKAAQKHLBEVKTS^GlisLis 480 
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BVH3 WU2 
BVH3 RX1 
BVH3 JMR7/87 
BVH3 SP64 
BVH3 P4241 
BVH3 AS 6 



481 HEQDYPSNAKBMKDLDKKI EEKI A<3 XMKQ YGVKRESI WNKEKNAI IY PHQDHHHADPID 
HEQDYPSNAKEMKDIiDKKIEEKIAOIMKQyoVKRESIVOTnCEjaffil 



481 



UIYPHGDHHHADPID 



540 

540 
540 
S40 
540 
540 



BVH3 WU2 
BVH3 RX1 
BVH3 JNR7/87 
BVH3 SP64 
BVH3 P4241 
BVH3 AS 6 



BVH3 WU2 
BVH3 RX1 
BVH3 JNR7/87 
BVH3 SP64 
BVH3 P4241 
BVH3 AS 6 



BVH3 WU2 
BVH3 RX1 
BVH3 JNR7/87 
BVH3 SP64 
BVH3 P4241 
BVH3 A66 



.a =H=5s£Ss=~£g=5 a 

- ^:==^=^^S a 

^ANFELDQPYLPGQT 660 
660 



661 FKYTIASKDY 



Ml FKVTIASKDYPEVSYIX^FTVPTiZA'^^i^;^^, 



661 PKYTIASKDYPBVSYDGTPTV] 



VNPQFAVPKGTDftL 720 
; BTYtiR VKPQPA VP KGTDAL 720 



PTSIAYKMASQTI PYPFHAGDTYtJl VNPQFAVPKGTDAL 



720 



BVH3 WU2 
BVH3 RX1 
BVH3 JNR7/87 
BVH3 SP64 
BVH3 P4241 
BVH3 A66 



BVH3 HU2 
BVH3 RX1 
BVH3 JHR7/87 
BVH3 SP64 
BVH3 P4241 
BVH3 A66 



BVH3 WU2 
BVH3 RX1 
BVH3 JNR7/87 
BVH3 SP64 
BVH3 P4241 
BVH3 A66 



BVH3 WU2 
BVH3 RX1 
BVH3 JNR7/87 
BVH3 SP64 
BVH3 P4241 
BVH3 A66 



HI ^^^S^ll^ll^^^^^^srr 1VS 780 

a sss^^p^^assssBsss a 

711 !SS^™™a^^SSSSSSSKSSJBKS III 



781 

781 VPIIJSKENQTDKPSILP^^ 

781 

781 
781 



EKEKLSETGNSTSMSTL 840 



840 
840 
840 

KEKL5 BTGNSTSNSTL 840 



841 BBVPTVDPVQEKVAKF^^ 

84 jL 

841 
841 



eevptvdpvSbkw^^ 900 

EBVI^PvSeS^^ 900 



900 
900 
900 

KKNMADFTC3EAPQGN 900 



III ^1^%^™™^™*™^^ 



901 GENXPSBNGW^.™™*,,™^?!^ 960 

^PVKPKNSrDNGMmPEGNVQSDPM 9SO 



««, GQI1CPSENOICVSTGTVENQPTENKPADSLPE71PNBKP 

901 GE^i^SS^ 960 



BNSTDNGMUJPEGNVGSDPM 



960 



BVH3 WU2 
BVH3 RX1 
BVH3 JNR7/87 
BVH3 SP64 
BVH3 P4241 
BVH3 A66 



961 LDPAMBAPAVDPVQE^KPTaItoW^^J^^^^^ 0 ^ 11 ^" 0 "* 1019 

961 llll 
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BVHll-2 SP64 
BVHll-2 JNR7/87 
BVHll-2 P4241 
BVHll-2 A66 
BVHll-2 WU2 
BVHll-2 Rxl 
BVH11 P4241 
BVH11 WU2 
BVH11 A66 
BVH11 Rxl 
BVH11 JNR7/87 
BVH11 SP63 
BVH11 SP64 



J ^^^OAGQVKK^SNRVSYIDGDQAGQKAENLTPDBVSKREO 60 

1 CSYBIX3RHQAGQVKKESNRVSYIDGDQAGQKAENLTPDEVSKREOINAEQIVTKITM 60 

1 CS YBU3RHOAGQDKKESNRVAYI DGOQAGQKAENLTPDBVSKREGIKAEQI VI " 

' ^ ^^^Q^Q^^^^^^^^^DQAGQKAENLTPDBVS KREGINAEQI VI KITDQGY 60 

J no v AC ^ DKKES **** V * Y 1 DQOQAGQKA ENLT PDB VS KREG I NAEQ I V I K I TDQGY 60 

1 CSYEI^RHQAGQDKKESNRVAYIDGDQAGQKAENLTPDBVSKREGIN^ l Q 

1 CS YELGRHQAGQDKKESNR VAYI DGDQAGQKAENLTPDEVS KREG I NAEQ I VI K ITDQGY 60 

I CS^rrwnirnvIfif^ SNRVAY IDGDQAGQKAENLT PDEVS KREG I NAEQ I VI K ITDQGY 60 

1 CS YEIX3RHQAGQVXKESNRVSY I DGDQAGQKAENI*TPDEVS KREG INAEQI VI KITdS 60 
\ CSYEWRHQA^^ 

^^^^^^ 59 



61 VTJ 
61 



BVHll-2 SP64 61 

BVHll-2 JNR7/87 61 
BVHll-2 P4241 
BVHll-2 A66 

BVHll-2 WU2 61 

BVHll-2 Rxl 61 

BVH11 P4241 61 

BVH11 WU2 61 

BVK11 A66 si 

BVH11 Rxl 61 

BVH11 JNR7/87 61 

BVH11 SP63 61 

BVH11 SP64 60 



VTSHGDHYHYYNGKVPYDAIISEBU^^ 

VTSHGDHYHYYKGKVPYDAI I SBBLLMKDPNYQLKDSD I VNE I KGG YVI KVNGKYYVYIJC 
™^HYHYYNGKVPTOAIISBEIJ^ 

VTSHGDHYHYYNGKVP YDA 1 1 SBBLLMKD PNYQLKDS D I VNE I KGG YVI KVN^KYTOYLK 
VTSHGDHYHYYNGKVPYDAIISEELLMKDPNYQIJCDSDIVNE^ 

VT SHGDHYHYYNGKVP YDAI I S BELLMKD PNYQLKD S D IVNBI KGG YVI KVDG KYYVYLK 
VTSHGDHYHYYNGKVP YDAI I SEBLLMKDPNYQLKDSDIVNBI 

VTSHGDHYHYYNGKVPYDAIISBEI^KDPNYQUODSDIVNElS 120 
^ * ^^i^y^^ 01 ^ 1 * 1 S B E LLMKD PNYQLKD S D I VNE I KGGYVI KVNGKYYVYLK ill 



120 
120 
120 
120 
120 
120 
120 
120 
120 
120 
120 



BVH11 
BVH11 
BVH11 
BVH11 
BVH11 
BVH11 
BVH11 
BVH11 
BVH11 
BVH11 
BVH11 
BVH11 
BVH11 



-2 SP64 
-2 JNR7/87 
-2 P4241 
-2 A66 
-2 WU2 
-2 Rxl 
P4241 
WU2 
A66 
Rxl 

JNR7/87 
SP63 
SP64 



^ ^Af^N^TKEEIKRQKQBHSHNHNSRA- - - DNAVAAARAQGRYTTDDGYI FNASD HE 177 
121 DAAHADNI RTKEE I KRQKQEHSHNHGGGSN - - DQAWAARAQGRYTTDDGYI FNASD 1 1 E 1 1 1 
121 DAAHADNI RTKEE 1 KRQKQEHSHNHGGGSN - - ^AWAARA^R YTTDD^I FNASDI I E 178 
HI nf^^^^^^^^^^^^^^^^^^ " " ^^^^^^GR ^T^DDCTI FKA^DI I B 178 
^ ^^ A J^ DNIRT ^^^K R Q K QEHSHNHGGGSN- - DQAWAARAQGRYTTDDGYI FNASDIIB 178 

\l\ n^tr^ NIRT ^ E ^ ^^^^ R ^^ I ^ S ^ A DNAVAAARA^RYTTDDGYI FNASDI I E 111 

"J DAAHADNIRTKEBIKRQKQEHSHNHGGGSN- -DQAVVAARaSrY^^iS^E ^78 
121 DAAHADNI RTKEE I KRQKQEHSHNHGGGSN - -DQAVVAARA^RYT ™ 
^^NIRTKEEIKRQKQEHSHNHGGGSN- ^VVAAR^RY^^F^D^IE "2 
121 DAAHADNI RTKEE I KRQKQERSHNHNSRA - - - DNA VAAARA^RYTTDDGYI FNASD 1 1 E 177 
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^™° T0 TCAAACaTGA AAOTATTQTC GTGAATAAAG AAAAAAATGC GATTATTTAT 3 4,n 
CCGCATGGAG ATCACCATCA TGCAOATCCG ATTGATGAAC ATAAACCGGT TOGAAtS^ , 
S^A^JS GTAACTATGA ACTGTTTAAA cSg^S 1S5SEESZ 2 

££££££ JJaSSaS A ° AATTAACa ^QTTGTTA ATTTGTTAAA 0 
TTTAATAATC AAAACTTTAC TCTAGCCAAT GGTCAAAAAC nrmrrrrrT Tar.w™*~ 

££££ «?«»» ™ 

GATTATCCAQ AAGTAAQTTA TQATGQTACA TTTACAGTT? SSSi^ ~" 

====== 

GGAAATGCTT ATTTAGAAAA TAACTATAAA GTTGGTGAAA TCAAATTACC GATtStaaa 

SSSSS S2SS! ss ss jssss 

====== s 

a^=? SSSSK SSSSS SSSSS ™Es 

GGAAAAGTAT CTACTGGAAC AGTTG Art r IZ , lo AAAATAAACC ATCTGAAAAT 4560 
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CAGAGATCTT AQTGAATCAA 
AATATCTAQC TGGGTCAGTA 
TGCATCAAGC TCAAACTGTA 
CGACGCAAAA AACGGAGAAT 
CCGAACAAAT CGTCATCAAG 
ATCATTACTA TAATGGCAAG 
AAGATCCGAA TTATCAGTTO 
TCATTAAGGT AAACGGTAAA 
TCCGTACAAA AGAAGAAATC 
GGACTTCAGC AAACGATGQT 
ATGATGGTTA TATCTTCAAT 
TTCCTCATGG AGATCATTAC 
CTGCTGCAGA AGCCTTCCTA 
GACAAAATAG CGATAACACT 
CTAGAAATAC TAACACAAGC 
ACATTGATAG TCTCTTGAAA 
CTGATGGCCT TATTTTCGAC 
TCCCTCATGG TAACCATTAC 
GAATTGCTCG TATTATTCCC 
CAGAAGAACC AAGTCCACAA 
ATCCTCAACC AGCTCCAAGC 
AAQTAGGCGA TGGTTATGTC 
ATCTTTCAGC AGAAACAGCA 
CTCATAAGCT AGGAGCTAAG 
AGGC TTAT GA CTTACTAGCA 
TTGATTTTGA GGCTTTGGAT 
TCAAGTTAGT GGATGATATT 
GAAAACCAAA TGCGCAAATT 
GCAAGTACAC AACAGAAGAC 
GGGATGCCTA TGTAACTCCA 
CTGAAGCTGA GAGAGCGGCA 
CGACAGACCA TCAGGATTCA 
GCGTGAAAGC AGCTAAGAAO 
TAGAAGTCAA AAACGGTAGT 
TTGAGTGGTT TGACGAAGGC 
TGGCGACTGT CAAGTACTAT 
TTGGTAACGC TAGCGACCAT 
CGGAAAAACC AAGCGAGGAG 
AAGAGAAACC ACAAAGCGAO 
AATCACCAGA GGAATCAGAA 
GAGAGGCTGA AGATTTACTT 
AOACTCTCAC AGGATTAAAA 
TGGCAGAAGC TGAAAAACTA 
TTCTAACTCC TAAAAACAGG 
GTTCTAG (SED ID NO 



ATATACTTAA 
GCTACACTTG 
AAAGAAAATA 
TTGACTCCTG 
ATTACGGATC 
GTCCCTTATG 
AAGGATTCAG 
TACTATGTTT 
AATCGGCAAA 
GCGGTAGCCT 
GCATCTGATA 
CATTACATTC 
TCTGGTCGGG 
CCAAGAACAA 
AACAACAGCA 
CAGCTCTACA 
CCAGCGCAAA 
CACTTTATCC 
CTTCGTTATC 
CCOACTCCAG 
AATCCAATTG 
TTTGAGGAGA 
GCAGGCATTG 
AAAACTGACC 
AGAATTCACC 
AACCTGTTGG 
CTTGCCTTCT 
ACCTACACTG 
GGTTATATCT 
CATATGACCC 
GCCCAGGCTT 
GGAAATACTG 
GTGCCACTTG 
TTAATCATAC 
CTTTATGAGG 
GTCGAACATC 
GTTCAAAGAA 
AAACCTCAGA 
AAACCAGAGT 
GAACCTCAGG 
GGAAAAATCC 
AATAATTTAC 
TTGGCTTTAT 
ATAGGAGAAC 
12) 



GAAAAGAGGA 
TTTTAAGTGT 
ATCGTGTTTC 
ATGAGGTTAG 
AAGGTTATGT 
ATGCCATCAT 
ACATTGTCAA 
ACCTTAAGGA 
AACAAGAACA 
TTGCACGTTC 
TCATCGAAGA 
CTAAGAATGA 
AAAATCTGTC 
ACTGGGTACC 
ACACTAACAG 
AACTGCCTTT 
TCACAAGTCG 
CTTATGAACA 
GTTCAAACCA 
AACCTAGTCC 
ATGAGAAATT 
ATGGAGTTTC 
ATAGCAAACT 
TCCCATCTAG 
AAGATTTACT 
AACGACTCAA 
TAGCTCCGAT 
ATGATGAGAT 
TTGATCCTCG 
ATAGCCACTG 
ATGCTAAAGA 
AGGCAAAAGG 
ATCGTATGCC 
CTCATTATGA 
CACCTAAGGG 
CAAACGAACG 
ACAAAAATGG 
CAGAAAAACC 
CTCCAAAACC 
TCGAGACTGA 
AGGATCCAAT 
TATTTGGCAC 
TAAAGGAGAG 
GGGAAAACGA 



AAGAATGAAA 
CTGTGCTTAT 
CTATATAGAT 
CAAGCGTGAA 
GACCTCTCAT 
CAGTGAAGAG 
TGAAATCAAG 
TGCAGCTCAT 
TAGTCAGCAT 
ACAGGGACGC 
TACGGGCGAT 
GTTATCAGCT 
AAATTTAAGA 
TTCTGTAAGC 
TCAAGCAAGT 
GAGTCAACGC 
AACCGCCAGA 
AATGTCTGAA 
TTGGGTACCA 
AAOTCCGCAA 
GGTCAAAGAA 
TCGTTATATC 
GGCCAAGCAG 
TGATCGAGAA 
TGATAATAAA 
GGATGTCTCA 
TCGTCATCCA 
TCAAGTAGCC 
TGATATAACC 
GATTAAAAAA 
GAAAGGTTTG 
AGCAGAAGCT 
TTACAATCTT 
CCATTACCAT 
GTATACTCTT 
TCCGCATTCA 
TCAAGCTGAT 
TGAGGAAGAA 
AACAGAGGAA 
AAAGGTTGAA 
TATCAAGTCC 
CCAGGACAAC 
TAAGTAAAGG 
AAAATGAGAG 



ATCAATAAAA 
GAACTAGGTT 
GGAAAACAAG 
GGAATCAACG 
GGAGACCATT 
CTCCTCATGA 
GGTGGTTATG 
GCGGATAATG 
CGTGAAGGAG 
TACACCACAG 
GCCTATATCG 
AGCGAGTTGG 
ACCTATCGCC 
AATCCAGGAA 
CAAAGTAATG 
CATGTAGAAT 
GGTGTAGCTG 
TTGGAAAAAC 
GATTCAAaAC 
CCTGCACCAA 
GCTGTTCGAA 
CCAGCCAAGA 
GAAAGTTTAT 
TTTTACAATA 
GGTCGACAAG 
AGTGATAAAG 
GAACGTTTAG 
AAGTTGGCAG 
AGTGATGAGG 
GATAGTTTGT 
ACCCCTCCTT 
ATCTACAACC 
CAATATACTG 
AACATCAAAT 
GAGGATCTTT 
GATAATGGTT 
ACCAATCAAA 
ACCCCTCGAO 
CCAGAAGAAG 
GAAAAACTGA 
AATGCCAAAG 
AATACTATTA 
TAGCAGCATT 
CAGAATGTGA 
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GGQTCTTAAA 
AAATCTGOAA 
TCAATAAAAA 
AACTTGOTCQ 
ATGGTGATCA 
AGGGGATCAA 
ATGGAGACCA 
AACTTCTCAT 
AOGGTGGCTA 
ATGCGGACAA 
ATCATAACTC 
CGGATGATGG 
TCGTTCCTCA 
TAGCTGCTGC 
GTTATAATGC 
CAACTTATCA 
AACCCTTATC 
CAAGTCGAAC 
ATGAACAAAT 
CAAACCATTG 
CTAGTCCAAO 
AGAAATTGGT 
GAGTTTCTCG 
GCAAACTGGC 
CATCTAGTGA 
ATTTACTTGA 
GACTCAAGGA 
CTCCGATTCG 
ATGAGATTCA 
ATCCTCGTGA 
GCCACTGGAT 
CTAAAGAGAA 
CAAAAGGAGC 
GTATGCCTTA 
ATTATGACCA 
CTAAGGGGTA 
ACGAACGTCC 
AGGCAGACCA 
CTCACCCTGA 
TTTATAAACC 
AGGCTGAAAT 
CCTTGCTAGA 
GTCTAAAAAG 
ATAGTCTCTT 
(SEQ ID NO 



ACTCTGAATC 
GAAAATATGA 
ATATCTAGCA 
TCACCAAGCT 
GGCTGGTCAA 
CGCCGAACAA 
TTATCATTAC 
GAAAGATCCG 
TGTGATTAAG 
TATTCGGACA 
AAOAGCAGAT 
GTATATCTTC 
CGGCGACCAT 
AGAAGCCTAT 
AAATCCAGTT 
TCAAAATCAA 
AGAACGCCAT 
CGCCAGAGGT 
GTCTGAATTG 
GGTACCAGAT 
TCTGCAACCT 
CAAAGAAGCT 
TTATATCCCA 
CAAGCAGGAA 
TCGAGAATTT 
TAATAAAGGT 
TGTCTCAAGT 
TCATCCAGAA 
AGTAGCCAAG 
TATAACCAGT 
TAAAAAAGAT 
AGGTTTGACC 
AGAAGCTATC 
CAATCTTCAA 
TTACCATAAC 
TAGTCTTGAG 
GCATTCAGAT 
AGATAGTAAA 
ATCTGATGAA 
AAGCACTGAT 
TCCTCAAGTA 
AAAAGTAACA 
TAGTCTTCTT 
GGCTTTGTTA 
: 13) 



CTTTAGAGGC 
GTGTTCTAGC 
GGTTCAGTGG 
GGTCAGGTTA 
AAGGCAGAAA 
ATTGTTATCA 
TATAATGGCA 
AATTATCAGT 
GTAGACGGAA 
AAAGAAGAGA 
AATGCTGTTG 
AATGCATCTG 
TACCATTACA 
TGGAATGGGA 
CAACCAAGAT 
GGGGAAAACA 
GTAGAATCTG 
GTAGCTGTCC 
GAAAAACGAA 
TCAAGACCAG 
GCACCAAATC 
GTTCGAAAAG 
GCCAAGGATC 
AGTTTATCTC 
TACAATAAGG 
CGACAAGTTG 
GATAAAGTCA 
CGTTTAGGAA 
TTGGCAGGCA 
GATGAGGGGG 
AGTTTGTCTG 
CCTCCTTCGA 
TACAACCGCG 
TATACTGTAG 
ATCAAATTTG 
GATCTTTTGG 
AATGGTTTTG 
CCTGATGAAG 
AAAGAGAATC 
ACGGAAGAGA 
GAGAATTCTG 
GATCCTAGTA 
CTCGGAACGA 
AAAGAAAGTC 



AGACCCACAA 
AGAAGAATTA 
CAGTCCTTGC 
AGAAAGAGTC 
ATTTGACACC 
AGATTACGGA 
AGGTTCCTTA 
TGAAGGATTC 
AATACTATGT 
TTAAACGTCA 
CTGCAGCCAG 
ATATCATTGA 
TTCCTAAGAA 
AGCAOGGATC 
TGTCAGAGAA 
TTTCAAGCCT 
ATGGCCTTAT 
CTCATGGTAA 
TTGCTCGTAT 
AACAACCAAQ 
CTCAACCAGC 
TAGGCGATGG 
TTTCAGCAGA 
ATAAGCTAGG 
CTTATGACTT 
ATTTTGAGGT 
AGTTAGTGGA 
AACCAAATGC 
AGTACACAAC 
ATGCCTATGT 
AAGCTGAGAG 
CAGACCACCA 
TGAAAGCAGC 
AAGTCAAAAA 
AGTGGTTTGA 
CGACTGTCAA 
GTAACGCTAG 
ATAAGGAACA 
ACGCTGGTTT 
CAGAGGAAGA 
TTATTAACGC 
TTAGACAAAA 
AAGATAATAA 
AACCGGCTCC 



AATGACAAGA 
AAGTGAGGAA 
CCTAAGTGTT 
TAATCGAGTT 
AGATGAAGTC 
TCAAGGTTAT 
TGATGCCATC 
AGACATTGTC 
TTACCTTAAA 
GAAGCAGGAA 
AGCCCAAGGA 
GGACACGGGT 
TGAGTTATCA 
TCGTCCTTCT 
CCACAATCTG 
TTTACGTGAA 
TTTCGACCCA 
CCATTACCAC 
TATTCCCCTT 
TCCACAATCG 
TCCAAGCAAT 
TTATGTCTTT 
AACAGCAGCA 
AGCTAAGAAA 
ACTAGCAAGA 
TTTGGATAAC 
TGATATTCTT 
GCAAATTACC 
AGAAGACGGT 
AACTCCACAT 
AGCGGCAGCC 
GGATTCAGGA 
TAAGAAGGTG 
CGGTAGTTTA 
CGAAGGCCTT 
GTACTATGTC 
TGACCATGTT 
TGATGAAGTA 
AAATCCTTCA 
AGCTGAAGAT 
TAAGATAGCA 
TGCTATGGAG 
CACTATTTCA 
TATACAGTAG 



CCTATTTAGA 
AGAATGAAAA 
TGTTCCTATG 
TCTTATATAG 
AGTAAGAGAG 
GTGACCTCTC 
ATCAGTGAAG 
AATGAAATCA 
GATGCGGCCC 
CACAGTCATA 
CGTTATACAA 
GATGCTTATA 
GCTAGCGAGT 
TCAAGTTCTA 
ACTGTCACTC 
TTGTATGCTA 
GCGCAAATCA 
TTTATCCCTT 
CGTTATCGTT 
ACTCCGGAAC 
CCAATTGATG 
GAGGAGAATG 
GGCATTGATA 
ACTGACCTCC 
ATTCACCAAG 
CTGTTGGAAC 
GCCTTCTTAG 
TACACTGATG 
TATATCTTTG 
ATGACCCATA 
CAGGCTTATG 
AATACTGAGG 
CCACTTGATC 
ATCATACCTC 
TATGAGGCAC 
GAACATCCAA 
COTAAAAATA 
AGTGAGCCAA 
GCAGATAATC 
ACCACAGATG 
GATGCGGAGG 
ACATTGACTG 
GCAGAAGTAG 
TAAAATGAA 
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MKINKKYLAG SVAVLALSVC SYELGRHQAG QVKKESNRVS YIDannArnir 
AENLTPDEVS KREQINAEQI VIKITDQGYV TSHqSSy SSSSSX? 
SEELLMKDPN YQLKDSDIVN EIKGGYVIKV SSJ5£ SEES 

S2SE5 SSKS 22122 SSSSK SESSS 

QITSRTAROV AVPHGNHYHF IPYEQMSELE ££££££ 
SgJsrSpI SIX™ PNP Q PAPS ^ IDEKLVKEAV £££££1 

SS2f?J« KDIjSAETAAO idsklakqes lshklgakkt dlpssdrefy 
NKAYDUARI HQDLLDNKGR QVDFBVLDHI. LBRLKDVSSD KVKLVdSS 
FLAPIRHPER LGKPNAQITY TDDEIQVAiCb AGKYTTEDGY £££££££ 
EGOAYVTPHM THSHWIKKDS LSEAERAAAQ AYAKEKGlS JSdSJS 
TEAKGAEAIY NRVKAAKKVP LDRMPYNIiQY TVEVKNGSLI xSESlSSZ 

^ewpdegly eapkgysled llatvkyy^e hpnerpSdn gfgSdSS 

KWKADQDSKP DEDKEHDEVS EPTHPESDEK ENHAGLMPSA dSS 
EBTEEEAEDT TDEAEIPQVE NSVINAKIAD AEAI»LEKVTD PSlSS 

SPSS ™ ISA WDBttM « ESQPAPI ^ 

FIGURE 17 
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TGTGCCTATO 
TAT0T6GAT0 
CAQAAAGAAQ 
ACGTCACACO 
AQTGAAGAAC 
GAAGTCAAGG 
GCAGCTCATG 
GTCAAAGATA 
ACGACAAATG 
TATATCGTTC 
GAATTAGCAG 
TATTCTTCAA 
GAAGAAGATG 
GTCATGAGTC 
ATTAAGGCTG 
TTGTCATCTC 
AAAATCGAAG 
GTCGTGAATA 
CCGATTGATG 
AAACCCGAAG 
ACOAATGTTG 
AATGGTCAAA 
AATATGCTAG 
GTATTTGGAG 
GGACAAACAT 
ACATTTACAG 
TTCCATGCAQ 
GATGCTTTAO 
AAAGTTGGTG 
GGAAATAAAA 
ATTGTGGAAG 
CAATTTAAAA 
AAGACTAGTG 
TCAACGTTAG 
GAAAGTTATG 
TATTTACCAT 
CAAGGAAATG 
AACCAACCAA 
GTAAAACCAG 
GACCCTATGT 
TTAGAAAAAT 
GGAACGATTG 
TAGCGTAA 



CACTAAACCA 
GCAGCCAGTC 
GAATTCAGGC 
GTGATCACTA 
TCTTGATGAA 
GTGGTTATAT 
CTGATAATGT 
ATGAGAAGGT 
ATGGTTATGT 
CTCATGGAGG 
CAGCTAAAGC 
CACCTTCTCC 
GATACGGATT 
ACGGAGACCA 
CGCAAAAACA 
ATGAACAGGA 
AAAAAATTGC 
AAGAAAAAAA 
AACATAAACC 
AAGGAGTTGC 
TTAATTTGTT 
AACGCGTTTC 
TAAAATTAAT 
AAGGAGTAGG 
TTAAGTATAC 
TTCCAACCTC 
GGGATACTTA 
TCAGAGTGTT 
AAATCAAATT 
TTCCTGTAAC 
TACCTATCTT 
GGAATAAAGC 
AGAAGGTAGA 
AAGAAGTTCC 
GGATGAAGCT 
CGGGAGAAGT 
GTGAAAATAA 
CAGAAAATAA 
AAAACTCAAC 
TAGATTCAGC 
TTACAGCTAG 
AATTAAGATT 
(SEQ 10 NO : 15) 



GCATCGTTCG 
AAGTCAGAAA 
TGAGCAAATT 
TCATTACTAT 
GGATCCAAAC 
CATCAAGGTC 
TCGAACTAAA 

taactctaat 
ctttaatcca 
tcactatcac 
acatctggct 
atctcttcca 
tgatgctaat 
cAatcattat 
tttagaggaa 
ttatccaagt 
tggcattatg 

TGCGATTATT 
GGTTGGAATT 
TAAAAAAOAA 
AAAAAATAGT 
TTTTAGTTTT 
AACACCAOAT 
GAATATTGCA 
TATCGCTTCA 
TTTAGCTTAC 
TTTAAGAGTG 
TGATGAATTT 
ACCGATTCCG 
CTTCATGGCA 
GGAAAAAGAA 
ACAAGAAAAC 
AAAAGAAAAA 
TACAGTGGAT 
AGAAAATGTC 
CATTAAAAAG 
ACCATCTGAA 
ACCAGCAGAT 
GGATAATGGA 
ATTAGAGGAA 
TTACGGATTA 
GCCAAGTGGA 



CAGGAAAATA 
AGTGAAAACT 
GTAATCAAAA 
AATGGGAAAG 
TATCAACTTA 
GATGGAAAAT 
GATGAAATCA 
GTTGCTGTAG 
GCTGATATTA 
TACATTCCCA 
GGAAAAAATA 
ATCAATCCAG 
CGTATTATCG 
TTCTTCAAGA 
GTTAAAACTA 
AATGCCAAAG 
AAACAATATG 
TATCCGCATG 
GGTCATTCTC 
GGGAATAAAG 
ACGTTTAATA 
CCGCCTGAAT 
GGAAAAGTAT 
AACTTTGAAT 
AAAGATTATC 
AAAATGGCCA 
AACCCTCAAT 
CATGGAAATG 
AAATTAAACC 
AATGCTTATT 
AATCAAACTG 
TCAAAACTTG 
CTTTCTGAAA 
CCTGTACAAG 
TTGTTTAATA 
AATATGGCAG 
AATGGAAAAG 
TCTTTACCAG 
ATGTTGAATC 
GCTCCAGCAG 
GGCTTAGATA 
GAAGTGATAA 



AGGACAATAA 
TGACACCAGA 
TTACAGATCA 
TTCCTTATGA 
AAGACGCTGA 
ATTATGTCTA 
ATCGTCAAAA 
CAAGGTCTCA 
TCGAAGATAC 
AAAGCGATTT 
TGCAACCGAG 
GAACTTCACA 
CTGAAGATGA 
AGGACTTGAC 
GTCATAATGG 
AAATGAAAGA 
GTGTCAAACG 
GAGATCACCA 
ACAGTAACTA 
TTTATACTGG 
ATCAAAACTT 
TGGAGAAAAA 
TGGAGAAAGT 
TAGATCAACC 
CAGAAGTAAG 
GTCAAACGAT 
TTGCAGTGCC 
CTTATTTAGA 
AAGGAACAAC 
TGGACAATCA 
ATAAACCAAG 
ATGAAAAGGT 
CTGGGAATAG 
AAAAAGTAGC 
TGGACGGAAC 
ATTTTACAGG 
TATCTACTGG 
AGGCACCAAA 
CAGAAGGGAA 
TAGATCCTGT 
GTGTTATATT 
AAAAGAATTT 



TCGTGTCTCT 
CCAGGTTAGC 
GGGCTATGTA 
TGCCCTCTTT 
TATTGTCAAT 
CCTGAAAGAT 
ACAAGAACAT 
GGGACGATAT 
GGGTAATGCT 
ATCTGCTAGT 
TCAGTTAAGC 
TGAGAAACAT 
ATCAGGTTTT 
AGAAGAGCAA 
ATTAGATTCT 
TTTAGATAAA 
TGAAAGTATT 
TCATGCAGAT 
TGAACTGTTT 
AGAAGAATTA 
TACTCTAGCC 
ATTAGGTATC 
ATCTGGTAAA 
TTATTTACCA 
TTATGATGGT 
TTTCTATCCT 
TAAAGGAACT 
AAATAACTAT 
CAGAACGGCC 
ATCGACTTAT 
TATTCTACCA 
AGAAGAACCA 
TACTAGTAAT 
AAAATTTGCT 
AATTGAATTA 
AGAAGCACCT 
AACAGTTGAG 
CGAAAAACCT 
TGTGGGGAGT 
ACAAGAAAAA 
CAATATGGAT 
ATTGATCTCA 
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zsss sssss 2SKK = - - 

ELAAAKAHUA OKNMQPSQLS iSSSJS E"™ *"KSM*AS 200 

RIIAEDESGF VHWWwS ££££££ SSSSSfS EED03f6FDAN MO 

LSSHEQDYPS NAKEMKDLDK KI EEKI Afl T m EE WSHiKSLDS 3 00 

yPHGDHHHAD PIDEHKPVGI SSSSf! S™ VKRSS1 VVN ™I 350 

TNWNLLKNS TFNNQNFTIiA S£2SJ5£ Jfff™™ GN1 <VYTO E EL 4" 

GKVLEKVSGK VFGEGVGMlJ SSSSvfS on™!-**" 31 NM ^KLITPD 450 

TFTVPTSLAY KMASQTIFYP ' FHAGDTYLR V S™^"* 8 ^"EVSYDO 500 

HGNAYLENNY KVGEIKLPIp SJSJSEf " PQFAVPKGT DALVRVFDEF S5 0 

IVEVPILEKE RoSSS SK*™* NA ™>*°STY 6 oo 

LSETGNSTSN STLEE^ PV^SIa SEEiX ^KVEKEK 6S ° 0 

YLPSGEVIKK NMADFTGEAP ff^"^ ENV LFNMDGTIEL 700 

SLPEAPNEKP VKPENStSg SpEG^f 5222 ^"^^ »« 

SFJTS ?^r iFNMD 0TIE= -~ — QEK 

FIGURE 19 



ZSSSSSl SEEK ZSEXS qkeoiqaeoi 

BVKGGYHKV «wSE25 ^ADIVN xJJ 

VAVARSQQRY TTNDGYVFNP adttpS deinr QKQeh VKDNEKVNSN iso 
EXAAAKAHIA oSSSSS YIVPHGQHY H YIPKSDLSAS 2 00 

i^lyds PS mkSSSS. JS2SE S25S5 PA ™* s 

IiSAItEEKIAR MVPISGTOST vJSSSSJ JSESfSS ° HyHFIPySK 300 
SSASDOYIFM PKDIVEETAT AYlSu^ ^SSLGSLSSN PSSLTTSKEL 350 
TPSPSLPINP GTSHeS S5ESE SSeSfS? g™""* 
KDIiTEEQIKA AQKHLEEVKT SHNQLnor 00 A f° ESGFVMS HGDHNHYFFK 450 
EKIAGIMKQY OVKRESIV^ SS HEQDYPGNAK EMKDLDKKIE 500 
HSNYBLFKPE EGVAKKEGNR J^o™" ° DHHHADPID EHKPVGIQHS 550 
KRVSFSFPPE SSS Z5K2SZ NQNFTLANGQ 

LOQPYIiPGQT FKYTIASKDY PEVSySS vf™f° F ° EaVG NIANFE 650 
GDTYLRVNPQ FAVPKGTDAL SSS EXEE* «»™»f» 700 
QOTTRTAGNK IPVTFMANAY iSSSSS! AVLENMYKVG EIKLPIPKLN 
RNKAQENSKL DEKVEeJSs KKSS DKPSILPQFK 
BKVAKFAESY GMKLeS SSSfp 252" STL EEVPT ™*VQ 
GENKPSENGK vstgtvenop XSSS^ sgevikknma DFTGEAPQGN 
pegnvgsdpm lwaSEX JESKS ESSES ENSTD1 « JMU ' 

ELRLPSQEVI KKNIiSOFZA (SEQ S Jo^^f ^ SVIFNMDG ™ 1000 
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50 
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150 



^u™™™ 3 YVDOS QSSQK SENLTPDQVS QKEGIQAEQI 
SkoJS?^ S^3 HYY NGKVP ">^ SEELLMKDPN YQLKDADIVN 

SS SEES ESSE S5SSSS £222 
SUSZSSZ ssss p y =JI 2 2 5 °o° 

TPSPSLPINP GTSHRVurrn * HYIPKS NQ1Q QPTLPNNSLA 400 

ti= jssr — «• 

FIGURE 21 



S££S? 22252 SSS22 22225 YVDGSQSSQK SE1 ™ QVS 

EVKGGYIIKV DGKYYVYLKD Ia£anS£ NGKVPYDALF SEELLMKDPN YQLKDADIVN > 120 

TTNDGYVFNP ££££££ S2 ™ HEKVNSN 1M 

YSSTASDNNT QSVAKGSTSK nSSSS^ ) f ?!5 SDLSAS ELAAAK AHLA OKNMQPSQLS 240 

TPNGVAIPHG SSSSSJ 2!25£K LLKELYDSPS AQRYSESDGL VFDPAKIISR 30 0 

PSSLTTSKEL s£2aJ5J ESS^S^ MVPISOTGST VSTNAKPNEV VSSLGSLSSN l 60 

TPSPSLPINP SJS 2S22S EEE™ """■W QPTLPNNSLA «J 
AQKHLEEVKT SHNgSS 25S2F ^E^no" 0 ^ 1 ™ "»™»* 



57) 

FIGURE 22 



22»22 SS S DSWSH ^DYPGNAKE MKDLDKKIEE 
SNYELfSeE V^r" PHQ DHHHAOPID * HKPVGIGHSH 

SS3S2 EEs ™i =S5= SSSSBE 

22222 SF ™- sees ss 

22SS2 2EEE "" EKENQTD »««5£ 

KVAKFAESYO Mncnurpuu KVEKEKLSET GNSTSNSTLE EVPTVDPVQE 
ENKPSENRK"v ™ M DGTIELYI -PS GEVIKKNMAD FTGEAPQGNG 

Si ESS ssas sssB ssss 

SSSS^ vifdprdiS Sm H E22SK 

SEES 8 " PP «DHQDSG UTb£ZS£ 22222J 
SSSJJSS ^f VKN ° SL "PHYDHYHN IKFEWFDEGL YEAPKGyS 
22222 p"™"f D ^NASDHV QRNKMGQA0T NQTeSSk 

222SK -SS— —"—I 



(SEQ ID NO • SB) U1UUW1 ' TiMAEAEKLL 1050 

1057 

FIGURE 23 
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S EEs 

ssraSF °"= s= s 

FIGURE 24 



EiE = ssss = ssss 

SSS3SE SSSS = SEE 5SSSS 

LSASELAAAE AFLSORENLS NLRTYRRQNS tSSSSSS 
NTSNNSNTNS QASQSNDIDS U,KQLYk£pl SQRHVES^L SaS 

s^yPTPEPSP SPQPAPNPQP APSNPIDEKL VKEAVRKVGD GYVFFFur-vc 
RYIPAKNLSA ETAAGIDSKL AKQESLSHKIi GAKKroSS DrSSSo 

iSSE 0 " ALDNLLERLK « 

toSJE tt2S25 ? VAKLAQKYT TEDOYIFDPR DITSDEGDAY 
.IfVvl™ IKKDSLSEAE RAAAQAYAKE KQLTPPSTDH ODSCJNTRAIfr" 

£££££ SS ^™ EVK »«™5 h q ™S 

0S™S JSSSSr ™»»«* SDHVQRNMJO 



QDMNTIMAEA EKLLaIL^ K u£q ij nSTS, ° LI0, «*«» {JJ 

FIGURE 25 



QTVKENNRVS YIDGKQATQK TENLTPDBVS KREQINAEOI 

(SEQ ID NO : 61) 334 

FIGURE 26 

EH ESS === "= = »» 

dSS52?i ^ T ^ DLLAR I HQDLU3NKG RQVDFEALDN LLERLKDVSS ISO 

DKVKLVDDIL AFLAPIRHPE RLGKPNAQIT YTDDEIQVAK LAGKYTTRnn ,™ 

oic^ ITS DEQDA ™">H MTHSHWIKKD SLSEAErW otSSS Hi 

"^KAAKKV PLDRHpX SS£5£ 3*0 

S5 3= — s ssbk ssss IS 

==E ESS ass- ~ G kiqdpuksn jj. 



FIGURE 27 
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a ~"™ 55522 S23S2 JS 

ssss 22222 22222 ssS EE i 

6X3 

FIGURE 28 

DLTEEQIKAA QKHLEEVKTS HNGLDSLSSH EQDYPGNAKE MKninwrPo 

Isiilliiiil = 

ss= sszss sssS ESS ^™ ■» 

25222 SEEK ™i i™ - 

LK1PSGEVJK nwoST ?23TSfS VIFHMOOTIB 5 50 

FIGURE 29 



DLTEEQIKAA QKHLEEVKTS HNGLDSLSSH EODYPGNAKE «imr n «rT BB 
KIAGIHKQYG VKRESIWNK EKNAIIYpS SSESSi hS^SS 

£££££ ZEE""' ™ E ^™™ iSEE SSSSSl 

EXfS25S£ f^ aiNMLV KLITPDQKVL EKVSGKVFGE GVGNXAMPEL 
DQPYLPGQTP KYTIASKDYP EVSYDGTFTV PTSLAYKMAS OTIFYPFHan 
DTYLRVNPQP AVPKGTDALV RVFDEFHGNA YLENNYKVgI JSmSSSS 
OTTRTAGNKI PVTFMANAYL ONQSTYIVE (SBQ ID NO f J" ° 3^ 

FIGURE 30 
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50 
100 
ISO 
200 
250 
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EEES ZSSSSS' ^^SK LDEKVEEPKT SEKVEKEKLS 
S^E^S QEKVAI ^AES YOMKIiENVLF NMDGTIELYL l 00 

EEEl JSSSSS EEE ZEES fJn E ™ 150 

ss"j?£ iel= "J 

FIGURE 31 



ESSS J?*™* itsrtargva vphgnhyhpi 

ISsS SSS SSSS2 ^ PSPQPTP EPS *SPQPAP 100 
DSKLAKQESI. fflSSSS ESffSS NLSAETAAGI 150 

EE =E £= sss EE 

=5S = 1= ===j =B 



(SEQ ID NO : 67) 

FIGURE 32 



250 
300 
350 
400 
450 



REAEDLLGKI ODPIIKSMAK ESPEESEEPQ VETEKVEEKL 500 

LKESK (SeS ll ^^^WLKNNL LFGTQDNNTI MAEAEKLLAL 55C> 



555 



DIDSLLKQLY 
PYEQMSELEK 
NPQPAPSNPI 
D3KLAKQESL 
VDFEALONLI« 
DDEIQVAKLA 
SEAERAAAQA 
DRMPYNLQYT 
LATVKYYVEH 



KLPIiSQRHVE 

RIARIIPLRY 

DEKLVKEAVR 

SHKLGAKKTD 

ERLKDVSSOK 

GKYTTEDGYI 

YAKEKGLTPP 

VBVKNGSLII 

PNERPHSDNG 



SDGLIFDPAQ 
RSNHWVPDSR 
KVGDGYVFEE 
LPSSDREFYN 
VKLVDDILAF 
F0PRDITSDE 
STDHQDSGNT 
PHYDHYHNIK 
FGNASDHV 



ITSRTARGVA 
PEEPSPQPTP 
NGVSRYIPAK 
KAYDLLARIH 
LAPIRHPERL 
GDAYVTPHMT 
EAKGAEAIYN 
FEWFDEGLYE 
(SEQ 10 no 
FIGURE 3 



VPHGNHYHFI 
EPSPSPQPAP 
NLSAETAAGI 
QDLLDNKGRQ 
GKPNAQITYT 
HSHWIKKDSL 
RVKAAKKVPL 
APKQYTLEDIj 
: 68) 



50 
100 
150 
200 
250 
300 
350 
400 
428 



FIGURE 34 121 



ZEES ESSE SEE ST"" PQSEKPESPK 

TGZ.KNKLI.Pa TQDMNTIMAE SSfSS EDLLGKIQDP IIKSNAKETL !00 
(SEQ ID NO : 70) 132 

< FIGURE 35 
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^SUSi SSSSS ITSRTAR0VA VPHGNHYHFI 

NPQPAPSNPI SJSSSS S5SSIS ISS**" 100 

DSKLAKQESL SHKLGaSS SSf™ «*™«« 150 

VDFEALDNLL ERLKdS JSSS ^ ^ ^D^^ QDLhD ^ ^00 



71) 
FIGURE 36 



(SEQ ID NO : 72) 

FIGURE 37 



FIGURE 38 



FIGURE 39 
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50 



226 



50 



iSe^Iw SSJ vaklagkytt kocvxpdpro 

DSGNTEAKGA hSSSSSS SSSnp^v GLTPPSTDHQ ^ 

YHNIKFEWFD EGLYEAPKGY 5SJ??S£ NIiQYTVEVKN GSLIIPHYDH 1S0 

DHV (SEQ ™ K ? 21 ^ YYVEHPNERP HSDNGFGNAS 2 Q0 



203 



SO 



??™ Q J ESSE SSSSS f MmnDW SKRE0INAE ° 

NEIKGGYVIK v5S£S "f ELLMKDP NVQLKDSDIV ^o 

NAVAAARAQG RYTTDDQYIF SSSSSSI KEEIKRQKQE HSHNHNSRAD 150 

aseuvaaea? wSSs sSsyS? SSSS S?^ 8 

GBNISSLLRE LYAKPLSFRH vr?e^,Vt!f vPRLSENHNL TVTPTYHQNQ 250 

fipyeqmsbl eSiiS IS™^* aqitsrt aro vavphgnhyh 300 

APNPQPAPSN pSESX SEE" 08 W1MMLQP 35^ 

GIDSKIAKQE SLSHKLGaS SS22S SEES J?* 1 *"* 400 
RQVDFEVLDN LLERLKDVSS DKVKLVno7i H?™"** IH Q° LI ^NKQ 
YTDDEIQVAK LAQKYTTEDG Y^^o ^^ IRHPE RLGKPNAQIT 
SLSEAERAAA QAYAKeS? SJSSiJ™ DE0DAmPH MTHSHWIKKD 
PLDRMPYNLQ yWSSI NTEA ^AEAI YNRVKAAKKV 

DLLATVKYYV EHPmSSSd SISSSE IKFEWFDEGI. YEAPKGYSLE 
SEPTHPESDB KENHAGLNPS RKMKA DQDSK PDEDKEHDEV 

ENSVINAKIA DAEALLEKVT JESSE If^EAED TTDEAEIPQV 
AEVDSLIiALIj lESSST ^EQ^NO *? I ™* MLb ""WIS 8 00 



450 
500 
550 
600 
650 
700 
750 



819 



50 



S S SSSSS QITSRTARGV AVPHGNHYHF 
PNPQPAPSNP 2S2S2S SS EXE 0 " PEPSP SW3PA !00 
IDSKLAKQES LSHKLGAKKT SEES SSSJSr "0 
QVDFEVLDNL LERLKDVSSD KVKLvnnrf^ 1 ^^ONKGR 200 

TDDEIQVAKL AGKYTTRnnv ™™ DIIA ^ IA PIRHPER LGKPNAQITY 
LSEAERAAAQ o^^ 1 ™ EGDAYVT PHM THSHWIKKDS 

LDRMpX? ST? k™^" NRVKA AKKV* 

LLATVKYYVE HPNERPHSDM Rm^S KFEWFDEGLY EAPKGYSLED 
EPTHPESDEK E»£S2££ ESSES ™ KADQDSKP ^EDKEHDEVS 
NSVINAKIAD aSESSS SS ffJff EAEOT TDEABIPQVB 
EVDSLLALiLK BSQPAPIQ^ ( SEQ ID^NO • ^ j" SLLL GTKDNNTISA 



250 
300 
350 
400 
450 
500 
550 
568 
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HDE VSEPT »^SD EKENHAGLNP sadnlykpst 50 

SB S 2 * =s ssss — sH 

FIGURE 40 



GACTTGACAG 
TAAAACTAGT 
ATCCAGGTAA 
AAAATTGCTG 
CGTGAATAAA 
ATGCAGATCC 
AGTAACTATG 
GAATAAAGTT 
AAAATAGTAC 
CGCGTTTCTT 
TATGCTAGTA 
CTGGTAAAGT 
GATCAACCTT 
AGATTATCCA 
TAGCTTACAA 
GATACTTATT 
TGCTTTAGTC 
ATAACTATAA 
GGAACAACCA 
TGCTTATTTG 
AAAAAGAAAA 
AATAAAGCAC 
GACTAGTGAG 
CTAGTAATTC 
AAAGTAGCAA 
GTTTAATATG 
TTAAAAAGAA 
GAAAATAAAC 
CCAACCAACA 
AAAAACCTGT 
GAAGGGAATG 
TCCAGCAGTA 
ACGGATTAGG 
TTAAGATTGC 
AGCGAAGCTT 
AAGAACCAAG 
GCACCAAATC 
CAAAGAAGCT 
GAGTTTCTCG 
GGCATTGATA 
AGCTAAGAAA 
CTTATGApTT 
CGACAAGTTG 
TGTCTCAAGT 
CTCCGATTCG 
TACACTGATG 
AGAAGACGGT 
ATGCCTATGT 



AAGAGCAAAT 
CATAATGGAT 
TGCCAAAQAA 
GCATTATGAA 
GAAAAAAATG 
GATTGATGAA 
AACTGTTTAA 
TATA CTGGAQ 
GTTTAATAAT 
TTAGTTTTCC 
'AAATTAATAA 
ATTTGGAOAA 
ATTTACCAGG 
GAAGTAAGTT 
AATGGCCAGT 
TAAGAGTGAA 
AGAGTGTTTG 
AGTTGGTGAA 
GAACGGCCGG 
GACAATCAAT 
TCAAACTGAT 
AAGAAAACTC 
AAGGTAGAAA 
AACGTTAGAA 
AATTTGCTGA 
GACGGAACAA 
TATGGCAGAT 
CATCTGAAAA 
GAAAATAAAC 
AAAACCAGAA 
TGGGGAGTGA 
GATCCTGTAC 
CTTAGATAGT 
CAAGTGGAGA 
CGTTATCGTT 
TCCACAACCG 
CTCAACCAOC 
GTTCGAAAAG 
TTATATCCCA 
GCAAACTGGC 
ACTGACCTCC 
ACTAGCAAGA 
ATTTTGAGGC 
GATAAAGTCA 
TCATCCAGAA 
ATGAGATTCA 
TATATCTTTG 
AACTCCACAT 



TAAGGCTGCG 
TAGATTCTTT 
ATGAAAGATT 
ACAATATGGT 
CGATTATTTA 
CATAAACCGG 
ACCCGAAGAA 
AAGAATTAAC 
CAAAACTTTA 
GCCTGAATTG 
CACCAGATGG 
GGAGTAGGGA 
ACAAACATTT 
ATGATGGTAC 
CAAACGATTT 
CCCTCAATTT 
ATGAATTTCA 
ATCAAATTAC 
AAATAAAATT 
CGACTTATAT 
AAACCAAGTA 
AAAACTTGAT 
AAGAAAAACT 
GAAGTTCCTA 
AAGTTATGGG 
TTGAATTATA 
TTTACAGGAG 
TGGAAAAGTA 
CAGCAGATTC 
AACTCAACGG 
CCCTATGTTA 
AAGAAAAATT 
GTTATATTCA 
AGTGATAAAA 
CAAACCATTG 
ACTCCAGAAC 
TCCAAGCAAT 
TAGGCGATGG 
GCCAAGAATC 
CAAGCAGGAA 
CATCTAGTGA 
ATTCACCAAG 
TTTGGATAAC 
AGTTAGTGGA 
CGTTTAGGAA 
AGTAGCCAAG 
ATCCTCGTGA 
ATGACCCATA 



CAAAAACATT 
GTCATCTCAT 
TAGATAAAAA 
GTCAAACGTG 
TCCGCATGGA 
TTGGAATTGG 
GGAGTTGCTA 
GAATGTTGTT 
CTCTAGCCAA 
GAGAAAAAAT 
AAAAGTATTG 
ATATTGCAAA 
AAGTATACTA 
ATTTACAGTT 
TCTATCCTTT 
GCAGTGCCTA 
TGGAAATGCT 
CGATTCCGAA 
CCTGTAACCT 
TGTGGAAGTA 
TTCTACCACA 
GAAAAGGTAG 
TTCTGAAACT 
CAGTGGATCC 
ATGAAGCTAG 
TTTACCATCA 
AAGCACCTCA 
TCTACTGGAA 
TTTACCAGAG 
ATAATGGAAT 
GATCCAGCAT 
AGAAAAATTT 
ATATGGATGG 
AAGAATTTAT 
GGTACCAGAT 
CTAGTCCAAG 
CCAATTGATG 
TTATGTCTTT 
TTTCAGCAGA 
AGTTTATCTC 
TCGAGAATTT 
ATTTACTTGA 
CTGTTGGAAC 
TGATATTCTT 
AACCAAATGC 
TTGGCAGGCA 
TATAACCAGT 
GCCACTGGAT 



TAGAGGAAGT 
GAACAGGATT 
AATCGAAGAA 
AAAGTATTGT 
GATCACCATC 
TCATTCTCAC 
AAAAAGAAGG 
AATTTGTTAA 
TGGTCAAAAA 
TAGGTATCAA 
GAGAAAGTAT 
CTTTGAATTA 
TCGCTTCAAA 
CCAACCTCTT 
CCATGCAGGG 
AAGGAACTGA 
TATTTAGAAA 
ATTAAACCAA 
TCATGGCAAA 
CCTATCTTGG 
ATTTAAAAGG 
AAGAACCAAA 
GGGAATAGTA 
TGTACAAGAA 
AAAATGTCTT 
GGAGAAGTCA 
AGGAAATGGT 
CAGTTGAGAA 
GCACCAAACG 
GTTGAATCCA 
TAGAGGAAGC 
ACAGCTAGTT 
AACGATTGAA 
CTGATTTCAT 
TCAAGACCAG 
TCCGCAACCT 
AGAAATTGGT 
GAGGAGAATG 
AACAGCAGCA 
ATAAGCTAGG 
TACAATAAGG 
TAATAAAGGT 
GACTCAAGGA 
GCCTTCTTAG 
GCAAATTACC 
AGTACACAAC 
GATGAGGGGG 
TAAAAAAGAT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1300 
1350 
1400 
1450 
1500 
1550 
1600 
1650 
1700 
1750 
1800 
1850 
1900 
1950 
2000 
2C50 
2100 
2150 
2200 
2250 
2300 
2350 
2400 
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AGTTTGTCTO 
AGQTTTGACC 
CAAAAGGAGC 
CCACTTGATC 
CGGTAGTTTA 
AGTGGTTTGA 
GATCTTTTGG 
GCATTCAGAT 
AAAATGGTCA 
CCTCAGACAG 
AAGCGAGAAA 
CACCAGAGGA 
AAACTGAGAG 
CAAGTCCAAT 
TTGGCACCCA 
GCTTTATTAA 



AAGCTGAGAG 
CCTCCTTCGA 
AGAAGCTATC 
GTATGCCTTA 
ATCATACCTC 
CGAAGGCCTT 
CGACTGTCAA 
AATGGTTTTG 
AGCTGATACC 
AAAAACCTGA 
CCAGAGTCTC 
ATCAGAAGAA 
AGGCTGAAGA 
GCCAAAGAOA 
GGACAACAAT 
AGGAGAGTAA 



AGCGGCAGCC CAGGCTTATO 
CAGACCATCA GGATTCAGGA 
TACAACCOCG TGAAAGCAGC 
CAATCTTCAA TATACTGTAG 
ATTATGACCA TTACCATAAC 
TATGAGGCAC CTAAGGGGTA 
GTACTATGTC GAACATCCAA 
GTAACGCTAG CGACCATGTT 
AATCAAACGG AAAAACCAAG 
GGAAGAAACC CCTCGAGAAG 
CAAAACCAAC AGAGGAACCA 
CCTCAGGTCG AGACTGAAAA 
TTTACTTGGA AAAATCCAGG 
CTCTCACAGG ATTAAAAAAT 
ACTATTATGG CAGAAGCTGA 
G (SEQ ID NO : 76) 



CTAAAGAGAA 
AATACTGAGG 
TAAGAAGGTG 
AAGTCAAAAA 
ATCAAATTTG 
TACTCTTGAG 
ACGAACGTCC 
CAAAGAAACA 
CGAGGAGAAA 
AGAAACCACA 
GAAGAAGAAT 
GGTTGAAGAA 
ATCCAATTAT 
AATTTACTAT 
AAAACTATTG 



2450 

2500 

2550 

2600 

2650 

2700 

2750 

2800 

2850 

2900 

2950 

3000 

3050 

3100 

3150 

3171 



FIGURE 41 



SEES SSSSS SSSS f 8 ™™ QNQGENISSL 

SELEKRIARI SXSXS 5255X1 ^° VAVPHGN HYHFIPYEQM 100 

KQESLSHKLG AKKTDLPSSD REPYMiravnr ?f!?f CDL8AB ^AAGIDSKLA 200 

IjDnllerlkd vssdkS SSS5SS «?^ FEV 

VAKLAGKYTT EDGYIFDPRD ITWrr^mv!! HPERLQKPNA QITYTDDBIQ 300 

AAAQAYAKEK GLTPPsS £££££££ ^PHMTHSHWI KKDSLSEAER 350 

NLQYTVEVKN GSLlSS SSSfS EAIYNRVKAA KKVPLDRMP* 400 

wvehpkerp ESSES SF^XS STFS*"*™*™ 450 



(SEQ ID MO : 77) "3 
FIGURE 42 
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"ssss? ssssx? sr 0 * «»«««. 

»«»•«» sssss SSSS JSST 

X£S SS5SK s Si -~ 

£55! ESSE i= PSSLTTSKBL 

TPSPSLPINP GTSHEKHEPn „S!™ HYIPKSNQIG QPTLPNNSLA 
KDLTEEQIKA ABDES<3 ^ HQDHNHYFFK 

bkJSSSS 22SS2 JS"**" emkdld ^* 

HSNYEIiFKPE EGVAKKEGNK I " GD HHHADPID EHKPVGIGHS 

KRVSFSFPPE SEEKS XSX™ *W™»00 600 

LDQPYLPGQT FKYTIASKDY pSc^™ LEKVSGKVFG EGVGNIANFE 650 
GDTYLRVNPQ 22 iSSSSSS A^v^ SQTIFYPFflA 
QGTTRTAGNK IPVTFMANAY SSSS ^^NO 1 ^^ Z!° 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 



(SEQ ID NO : 78) 780 
FIGURE 43 



50 
100 
156 
200 
250 



?S 1 SyS SI? K ^TPDBVS KREGINAEQI 
EIKGGYVIKV NGKYYVYLKn S ™ 1 SEELLMKDPN YQLKDSDIVN 
NDGA^aTaS ££££££ ^™ ^ NRQKQEH SQHREGGTSA 
LSASELAAAE AFLsSSS JSf 0 TGDAYIVPHG DHYHYIPKNE 
NTSNNSNTOS qSqSmdI SSSSS T^™™* SVSNPGTTNT 
TARGVAVPHG S255SS ^QRHVBSDGL IFDPAQITSR 300 

SPQPTPBPSP BwSESS JSv^f" W * DS ™** 350 

RYIPAKNLSA ETAAGIDSKL £5£2J£ 2SKf^ GyVFEEN °VS 400 
LliARIHQDLIi DNKGROVDFR ^mnrfnP' GAKKTDI ' PS S DREFYNKAYD 450 
RHPERLGKPN AQTrSKf SSSfES DVSSDKV ^V DDIUAFLAPI 500 
VTPHMTHSHW JSSSSS ££££££ ZX2K DITSDEGDAY 
AEAIYNRVKA AKKVPLDRMP YNLQYTVEVK 2K??™2S£ QDSGNTEAKO 600 

em rsr^ ™™ ssss — ^ »• 

FIGURE 44 
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GTGAAGAAAA 
GOAAQTTACC 
TATATTQATO 
ATCAGTGCTG 
TATGTGACCT 
ATTATTAGTG 
ATCAATGAAA 
AAGCCAGGTA 
AAAGGAACTA 
GAAGTTGCGG 
ATTTTTAGTC 
AATCACTATC 
GCCTACTGGA 
GCCCCAGGTC 
CAGCCAGATA 
AAACACCAAA 
CGTCTTGATT 
GTGATCAAAT 
CGAAGAAGTC 
ACTGAGGACA 
ACCTTTCTTG 
ACGAGTGATG 
ACAGCTAAAC 
TTGGATGAGG 
TTGOATCAGG 
GTAACAAAAG 
GCTCGTGATC 
GATAAGAAGC 
GATGTGTCAA 
GTTATCCCAC 
ATTGCAACAG 
CCAGGGCATG 
GGTATGCCAA 
GGTCGTTTTG 
ACTTTTGTAT 
ACCATTAATA 
AAGAAAAATA 
AAGAGCAATG 
TTTATAGACA 
CAATTAGCAC 
CAATTTTATA 
AACCCTTAA 



CATATGGTTA TATCGGCTCA 
AACTTQGTAA GCATCATATG 
ACAGCAAAGG TAAGGCAAAA 
AAGAAGGCAT CTCTGCTGAA 
CACACGGTGA CCATTATCAT 
AAGAGTTGTT GATGACGGAT 
TCTTAGACGG TTACGTTATT 
GTAAGCGCAA AAACATTCGA 
AAGAAGCTAA AGAAAAAGGT 
CAGTCAATGA AGCAAAAAGA 
CGACAOATAT CATTGATGAT 
ATTATATTCC TAAAAAGGAT 
GTCAAAAACA AGGTCGAGGT 
GTAGGAAAGC CCCAATTCCT 
ACGGTGGCTA TCATCCAGCG 
GAGATGAGTT TAAAGGAAAA 
TGAAATACCG TCATGTGGAA 
CAAACGCTTT TGGGTATGTG 
AGTTATCACC TCTTGAAATO 
ATGACTCAGG TTCAGAGCAC 
GTCATCGCAT CAAAGCTTAC 
CTTATGTTTT TAGTAAAGAA 
ACGGAGATCA TTTCCACTAT 
TCGCTAACTG GGTGAAAGCA 
AACAAGGCAA AGAAAAACCA 
ATGGTAAAGT GGGCTATATG 
AACTTGATTT GACTCAGATT 
ATTACCGTTA TGACATTGTT 
GTCTGCCGAT GCATGCTGGT 
ATATTGATCA TATCCATGTC 
TCAAGTATGT GATGCAACAC 
AAGAGTCAGG TTCGGTCATT 
ACTGGCAAAT TATCCATTCT 
CAACACCAGA CGGCTATATT 
GGAAAGATGG CTCCTTTAGC 
AATCTGATCT ATCCCAAGCT 
CTGGTGATGC TACTGATACG 
AAAACCAACA GCCAAGTGAA 
GTTTACCAGA CTATGGTCTA 
AAAAAGCTAA TATCGATCCT 
ATAAAAATGG TGAATTGGTA 
(SBQ ID NO : 80) 



GTTGCTGCCA 
GGTCTAGCAA 
GCCCCTAAAA 
CAGATCGTAG 
TTTTACAATG 
CCTAATTACC 
AAAGTCAATG 
ACCAAACAAC 
TTAGCTCAAG 
CAAGGACGCT 
TTAGGAGATG 
TTGTCTCCAA 
GCTAGACCGT 
GATGTGACGC 
CCTCCTAGGC 
ACCTTTAAGG 
GAAGATGGGT 
GTGCCTCATO 
GAATTAGCAG 
TCAAAACCAT 
GGAAAAGGCT 
TCCATTCATT 
ATAGGATTTG 
AAAGGTCAAG 
CTCTTTGACA 
ATGCCAAAAG 
GCCTTTGCCG 
GACACAGGTA 
AATGCTACTT 
GTTCCGTATT 
CCCGAAGTTC 
CCAAATGTTA 
GCTGAAGAAG 
TTCGATCCAC 
ATCCCAAGAG 
GAGTGGCAAC 
GATAAACCCA 
GCCAGTAAAG 
GATAGAGGAA 
AAGTATCTCA 
ACTTATGATA 



TTTTACTAGC 
CAAAGGACAA 
CAAACAAAAC 
TCAAAATTAC 
GGAAAGTTCC 
GTTTTAAACA 
GCAACTATTA 
AAATTGCTGA 
TGGCCCATCT 
ATACTACAGA 
CTTATTTAGT 
GTGAGCTAGC 
CTGATTACCG 
CTAACCCTGO 
CAAATGATGC 
AACTTTTAGA 
TGATTTTTGA 
GAGATCATTA 
ATCGATACTT 
CAGATAAAGA 
TAGATGGTAA 
CAGTGGATAA 
GAGAACTTGA 
CTGATGAGCT 
CTAAAAAAGT 
ATGGTAAGGA 
AACAAGAACT 
TTGAGCCACG 
ACGATACTGG 
CATGGTTGAC 
GTCCGGATGT 
CGCCTCTTGA 
TTCAAAAAGC 
GAGATGTTTT 
GAGATGGCAG 
AAGCTCAAGA 
AAGAAAAGCA 
AAGAAAAAGA 
CCCTAGAAGA 
TTTTCCAACC 
TCAAGACACT 



TACTCATATT 
TCAGATTGCC 
GATGGATCAA 
TGACCAAGGC 
TTATGATGCG 
ATCAGACGTT 
TGTTTACCTC 
GCAAGTAGCC 
CAGTAAAGAA 
CGATGGCTAT 
ACCTCATGGT 
TGCTGCACAA 
CCCGACACCA 
ACAAGGTCAT 
GTCACAAAAC 
TCAACTACAC 
ACCGACTCAA 
TCATATTATC 
AGCTGGCCAA 
AGTGACACAT 
ACCATATGAT 
ATCAGGAGTT 
ACAATATGAG 
TGCTGCTGCT 
GAGTCGCAAA 
CTATTTCTAT 
AATGCTTAAA 
ACTTGCTGTA 
AAGTTCGTTT 
GCGCGATCAG 
ATGGTCTAAG 
TAAACGTGCT 
CCTAGCAGAA 
GGCCAAAGAA 
TTCATTGAGA 
GTTATTGGCA 
ACAGGCAGAT 
ATCAGATGAC 
TCATATCAAT 
AGAAGGTGTC 
TCAACAAATA 
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VKKTYGYIGS VAAILLATHI GSYQLGKHHM GLATKDNQIA YIDDSKQKAK 

"£f EGISAB a*™"** wrsSSi 

IISEELLMTD PNYRFKQSDV INEILDOYVI KVNGNYYVYL KPGSKPKNTP 
^«^?n A KGTKEAKEKG ^QVAHLSKE WaSSEE 
«f^ IIDD LQDAYLV PHG NHYHYIPKKD LSPSELAAAQ AYWSoSgRG 
ARPSDYRPTP APGRRKAPIP DVTPNPGQGH QPDNGGYHPA WRpSSfoN 
£2552™ TFKELLD Q L " RLDLKYRHVE EDGLIFeSJ JSSSSSJ 
ZEESS ELADRYIAGQ TEDNDSGSEH SEES 

GKGLDGKPYD TSDAYVFSKE SIHSVDKSGV TAKHGDHFHY 
QYE "^"VKA KGQADELAAA LDQEOgSS ^S!sS 
VTKDGKVGYM MPKDGKDYFY ARDQLDLTQI AFAEOELMLK n^wv^n! 

225?™ dvsslpmhaq hatydtgSf 0S2ES ££££££ 

^^T QH PEVRPDV "SK PQHEESGSVI PNVTPLDKRA GmSS 
AEEVQKALAE GRFATPDGYr FDPRDVLAKE TFVWKDGSFS IPRAMsS 

KK^DATDT DKPKEKQQAD SeSe 

ssss? sssss sins sr^,™-* r 
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GTGAAQAAAA 
GQAAGTTACC 
TATATTGATQ 
ATCAGTGCTG 
TATGTGACCT 
ATTATTAQTG 
ATCAATGAAA 
AAGCCAGGTA 
AAAGGAACTA 
GAAGTTGCGG 
ATTTTTAGTC 
AATCACTATC 
GCTTACTGGA 
GCCCCAGGTC 
CAGCCAGATA 
AAACACCAAA 
CGTCTTGATT 
GTGATCAAAT 
CCAAGAAGTC 
ACTGAGGACA 
ACCTTTCTTG 
ACGAGTGATG 
ACAGCTAAAC 
TTGGATGAGG 
TTGGATCAGG 
GTAACAAAAG 
GCTCGTGATC 
GATAAGAACC 
GATGTGTCAA 
GTTATCCCTC 
ATTGCAACAA 
CCAGGGCATG 
GGTATGCCAA 
GGTCGTTTTG 
ACTTTTGTAT 
ACCATTAATA 
AAGAAAAACG 
AAGAGCAATG 
GACTTTATAG 
AATCAATTAG 
GTCCAATTTT 
ATAAACCCTT 



CATATGGTTA TATCGGCTCA 
AACTTGGTAA GCATCATATG 
ATAGCAAAGG TAAGGCAAAA 
AAGAAGGCAT CTCTGCTGAA 
CACACGGTGA CCATTATCAT 
AAGAGTTGTT GATGACGGAT 
TCTTAGACGG TTACGTTATT 
GTAAGCGCAA AAACATTCGA 
AAGAAGCTAA AGAAAAAGGT 
CAGTCAATGA AGCAAAAAGA 
CGACAGATAT CATTGATGAT 
ATTATATTCC TAAAAAAGAT 
GTCAAAAACA AGGTCGAGGT 
GTAGGAAAGC TCCAATTCCT 
ACGGTGGCTA TCATCCAGCG 
GAGATGAGTT TAAAGGAAAA 
TGAAATACCG TCATGTGGAA 
CAAACGCTTT TGGGTATGTG 
AGTTATCACC TCTTGAAATG 
ATGATTCAGG TTCAGATCAC 
GTCATCGCAT CAAAGCTTAC 
CTTATGTTTT TAGTAAAGAA 
ACGGAGATCA TTTCCACTAT 
TCGCTAACTG GGTGAAAGCA 
AACAAGGCAA AGAAAAACCA 
ATGGTAAAGT GGGCTATATT 
AACTTGATTT GACTCAGATT 
ATTACCGTTA TGACATTGTT 
GTCTGCCGAT GCATGCTGGT 
ATATTGATCA TATCCATGTC 
TCAAGTATGT GATGCAACAC 
AAGAGTCAGG TTCGGTCATT 
ATTGGCAAAT CATCCATTCT 
CAACACCAGA CGGCTATATT 
GGAAAGATGG CTCCTTTAGC 
AATCTGATCT ATCCCAAGCT 
CTGGTGATGC TACTGATACG 
AAAACCAACA GCCAAGTGAA 
ACAGTTTACC AGACTATGGT 
CACAAAAAGC TAATATCGAT 
ATAATAAAAA TGGTGAATTA 
AA (SEQ ID NO : 82) 



GTTGCTGCCA 
GGTCTAGCAA 
GCCCCTAAAA 
CAGATCGTAG 
TTTTACAATG 
CCTAATTACC 
AAAGTCAATG 
ACCAAACAAC 
TTAGCTCAAG 
CAAGGACGCT 
TTAGGAGACG 
TTGTCTCCAA 
GCTAGACCGT 
GATGTGACGC 
CCTCCTAGGC 
ACCTTTAAGG 
GAAGATGGGT 
GTGCCTCATG 
GAATTAGCAG 
TCAAAACCAT 
GGAAAAGGCT 
TCCATTCATT 
ATAGGATTTG 
AAAGGTCAAG 
CTCTTTGACA 
ATGCCAAAAG 
GCCTTTGCCG 
GACACAGGTA 
AATGCTACTT 
GTTCCGTATT 
CCCGAAGTTC 
CCAAATGTTA 
GCTGAAGAAG 
TTCGATCCAC 
ATCCCAAGAG 
GAGTGGCAAC 
GATAAACCCA 
GCCAGTAAAG 
CTAGATAGAG 
CCTAAGTATC 
GTAACTTATG 



TTTTACTAGC 
CAAAGGACAA 
CAAACAAAAC 
TCAAAATTAC 
GGAAAGTTCC 
ATTTTAAACA 
GCAACTATTA 
AAATTGCTGA 
TGGCCCATCT 
ATACTACAGA 
CTTATTTAGT 
GTGAGCTAGC 
CTGATTACCG 
CTAACCCTGG 
CAAATGATGC 
AACTTTTAGA 
TGATTTTTGA 
GAGATCATTA 
ATCGATACTT 
CAGATAAAGA 
TAGATGGTAA 
CAGTGGATAA 
GAGAACTTGA 
CTGATGAGCT 
CTAAAAAAGT 
ATGGCAAGGA 
AACAAGAACT 
TTGAGCCACG 
ACGATACTGG 
CATGGTTGAC 
GTCCAGATGT 
CGCCTCTTGA 
TTCAAAAAGC 
GAGATGTTTT 
CAGATGGCAG 
AAGCTCAAGA 
AAGAAAAGCA 
AAGAAGAAAA 
CAACCCTAGA 
TCATTTTCCA 
ATATCAAGAC 



TACTCATATT 
TCAGATTGCC 
GATGGATCAA 
TGACCAAGGT 
TTATGATGCG 
ATCAGACGTT 
TGTTTACCTC 
GCAAGTAGCC 
CAGTAAAGAA 
CGATGGCTAT 
ACCTCATGGT 
TGCTGCACAA 
CCCGACACCA 
ACAAGGTCAT 
GTCACAAAAC 
TCAACTACAC 
ACCGACTCAA 
TCATATTATC 
AGCCGGTCAA 
AGTGACACAT 
ACCATATGAT 
ATCAGGAGTT 
ACAATATGAG 
TGCTGCTGCT 
GAGTCGCAAA 
CTATTTCTAT 
AATGCTTAAA 
ACTTGCTGTA 
AAGTTCGTTT 
GCGCGATCAG 
ATGGTCTAAG 
TAAACGTGCT 
CCTAGCAGAA 
GGCCAAAGAA 
TTCATTGAGA 
GTTATTGGCA 
ACAGGCAGAT 
AGAATCAGAT 
AGATCATATC 
ACCAGAAGGT 
GCTTCAACAA 
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ApSS SISS™ 1 GSY « WI «HM OLATKDNQIA YIDDSKGKAK 
S2S JS5SS; SiSSffS" 00 VVTSHOD » y « FYNGKVPYDA 
TKQQIAEQVA SX222£ f m KVNGNYYVYL KPGSKRKNIR 

IFSPTDTTnn fXlP^ LAQVAHLSKE EVAAVNEAKR QGRYTTDDGY 

ssssss tsssssss sir sss 

KHQRDEFKGK tJSEESi J ^ GQ ° H a pDN <3°™PA PPRPNDASQN 
vpSgdhyJX SSSSS E ^"FEPTQ VIKSNAFGYV 

TFLGHRIKAY QKQI npvovn fLADRYLAGQ TEDNDSGSDH SKPSDKEVTH 
SeSS SSSEf f 1HSVDKSGV ™HGDHFHY 
VTKDGKVQYI MPKDGmvw ^° ADELAAA LDQEQGKEKP LFDTKKVSRK 
dtSprSJ ^8 LDLTQI ^QELMLK DKNHYRYDIV 

IATIKyS^ PeSdSSk S22XJ VIPHIDHIHV VPYSWLTRDQ 
AEEVQKAIAE GREATPMy? SSSS?f ^ PNVTPLDKRA GMPNWQIIHS 
TINKSDLSQA TFVWKDGSFS IPRADGSSLR 

ASKEEEKESD DFIDST Drwn ^ AGDM ^ DKPKEKQQAD KSNENQQPSE 
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